NFDI4DS | UHH-SEMS - Publication Details

Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

FOS: Computer and information sciences Computer Science - Computation and Language Computation and Language (cs.CL)

DOI: 10.48550/arxiv.2310.07521 Publication Date: 2023-01-01

Abstract Supplemental Material References Cited by

AUTHORS (16)

Wang, Cunxiang

Liu, Xiaoze

Yue, Yuanhao

Tang, Xiangru

Zhang, Tianhang

Jiayang, Cheng

Yao, Yunzhi

Gao, Wenyang

Hu, Xuming

Qi, Zehan

Wang, Yidong

Yang, Linyi

Wang, Jindong

Xie, Xing

Zhang, Zheng

Zhang, Yue

ABSTRACT

62 pages; 300+ references<br/>This survey addresses the crucial issue of factuality in Large Language Models (LLMs). As LLMs find applications across diverse domains, the reliability and accuracy of their outputs become vital. We define the Factuality Issue as the probability of LLMs to produce content inconsistent with established facts. We first delve into the implications of these inaccuracies, highlighting the potential consequences and challenges posed by factual errors in LLM outputs. Subsequently, we analyze the mechanisms through which LLMs store and process facts, seeking the primary causes of factual errors. Our discussion then transitions to methodologies for evaluating LLM factuality, emphasizing key metrics, benchmarks, and studies. We further explore strategies for enhancing LLM factuality, including approaches tailored for specific domains. We focus two primary LLM configurations standalone LLMs and Retrieval-Augmented LLMs that utilizes external data, we detail their unique challenges and potential enhancements. Our survey offers a structured guide for researchers aiming to fortify the factual reliability of LLMs.<br/>

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENAIRE - Products

PlumX Metrics

Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....