no code implementations • 19 Feb 2024 • Jonathan Zheng, Alan Ritter, Wei Xu
The performance of Large Language Models (LLMs) degrades from the temporal drift between data used for model training and newer text seen during inference.
no code implementations • 6 Feb 2024 • Anton Lavrouk, Ian Ligon, Tarek Naous, Jonathan Zheng, Alan Ritter, Wei Xu
The Stanceosaurus corpus (Zheng et al., 2022) was designed to provide high-quality, annotated, 5-way stance data extracted from Twitter, suitable for analyzing cross-cultural and cross-lingual misinformation.
no code implementations • 28 Oct 2022 • Jonathan Zheng, Ashutosh Baheti, Tarek Naous, Wei Xu, Alan Ritter
We present Stanceosaurus, a new corpus of 28, 033 tweets in English, Hindi, and Arabic annotated with stance towards 251 misinformation claims.