no code implementations • 28 Jan 2024 • Zhumin Chu, Qingyao Ai, Yiteng Tu, Haitao Li, Yiqun Liu
Existing paradigms rely on either human annotators or model-based evaluators to evaluate the performance of LLMs on different tasks.
Language Modelling Large Language Model +1