Search Results for author: Eunsu Kim

Found 3 papers, 2 papers with code

CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean

1 code implementation11 Mar 2024 Eunsu Kim, Juyoung Suk, Philhoon Oh, Haneul Yoo, James Thorne, Alice Oh

Despite the rapid development of large language models (LLMs) for the Korean language, there remains an obvious lack of benchmark datasets that test the requisite Korean cultural and linguistic knowledge.

Hate Speech Detection

Multi-FAct: Assessing Multilingual LLMs' Multi-Regional Knowledge using FActScore

1 code implementation28 Feb 2024 Sheikh Shafayat, Eunsu Kim, Juhyun Oh, Alice Oh

Large Language Models (LLMs) are prone to factuality hallucination, generating text that contradicts established knowledge.

Hallucination

The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate

no code implementations9 Feb 2024 Juhyun Oh, Eunsu Kim, Inha Cha, Alice Oh

This paper explores the assumption that Large Language Models (LLMs) skilled in generation tasks are equally adept as evaluators.

Question Answering TriviaQA

Cannot find the paper you are looking for? You can Submit a new open access paper.