Search Results for author: Charles Clarke

Found 4 papers, 1 papers with code

Rumour Evaluation with Very Large Language Models

1 code implementation11 Apr 2024 Dahlia Shehata, Robin Cohen, Charles Clarke

To the end, we employ two prompting-based LLM variants (GPT-3. 5-turbo and GPT-4) to extend the two RumourEval subtasks: (1) veracity prediction, and (2) stance classification.

Misinformation Prompt Engineering +2

Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications

no code implementations14 Feb 2024 Negar Arabzadeh, Julia Kiseleva, Qingyun Wu, Chi Wang, Ahmed Awadallah, Victor Dibia, Adam Fourney, Charles Clarke

The rapid development in the field of Large Language Models (LLMs) has led to a surge in applications that facilitate collaboration among multiple agents to assist humans in their daily tasks.

Math

Perspectives on Large Language Models for Relevance Judgment

no code implementations13 Apr 2023 Guglielmo Faggioli, Laura Dietz, Charles Clarke, Gianluca Demartini, Matthias Hagen, Claudia Hauff, Noriko Kando, Evangelos Kanoulas, Martin Potthast, Benno Stein, Henning Wachsmuth

When asked, large language models (LLMs) like ChatGPT claim that they can assist with relevance judgments but it is not clear whether automated judgments can reliably be used in evaluations of retrieval systems.

Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.