1 code implementation • 3 Mar 2024 • Shiqi Chen, Miao Xiong, Junteng Liu, Zhengxuan Wu, Teng Xiao, Siyang Gao, Junxian He
Large language models (LLMs) frequently hallucinate and produce factual errors, yet our understanding of why they make these errors remains limited.
1 code implementation • NeurIPS 2023 • Shiqi Chen, Yiran Zhao, Jinghan Zhang, I-Chun Chern, Siyang Gao, PengFei Liu, Junxian He
In this benchmark, we collect responses generated from LLMs and annotate factuality labels in a fine-grained manner.
2 code implementations • 23 May 2023 • Shiqi Chen, Siyang Gao, Junxian He
Detecting factual errors in summaries has been an important and challenging subject in summarization research.
no code implementations • 27 Nov 2022 • Yanwen Li, Siyang Gao, Zhongshun Shi
In this paper, we theoretically analyze these myopic procedures and prove that they also satisfy the optimality conditions of R&S, just like some other popular R&S methods.
no code implementations • 27 Nov 2022 • Yanwen Li, Siyang Gao
It builds the optimality conditions for the number of samples allocated to each design, and the sample allocation that satisfies the optimality conditions is shown to asymptotically maximize the probability of correct selection for the best design.
no code implementations • 14 Jun 2022 • Yanwen Li, Siyang Gao
Due to the complex calculation of KG, theoretical analysis of this algorithm is difficult, and existing results are mostly about the asymptotic performance of it, e. g., consistency, asymptotic sample allocation, etc.