no code implementations • 23 Mar 2024 • Amrita Bhattacharjee, Raha Moraffah, Joshua Garland, Huan Liu
With the advancement in capabilities of Large Language Models (LLMs), one major step in the responsible and safe use of such LLMs is to be able to detect text generated by these models.
no code implementations • 12 Mar 2024 • Tharindu Kumarage, Amrita Bhattacharjee, Joshua Garland
Large language models (LLMs) excel in many diverse applications beyond language generation, e. g., translation, summarization, and sentiment analysis.
no code implementations • 2 Mar 2024 • Tharindu Kumarage, Garima Agrawal, Paras Sheth, Raha Moraffah, Aman Chadha, Joshua Garland, Huan Liu
We have witnessed lately a rapid proliferation of advanced Large Language Models (LLMs) capable of generating high-quality text.
no code implementations • 8 Oct 2023 • Tharindu Kumarage, Paras Sheth, Raha Moraffah, Joshua Garland, Huan Liu
The novel universal evasive prompt is achieved in two steps: First, we create an evasive soft prompt tailored to a specific PLM through prompt tuning; and then, we leverage the transferability of soft prompts to transfer the learned evasive soft prompt from one PLM to another.
no code implementations • 23 Sep 2023 • Amrita Bhattacharjee, Raha Moraffah, Joshua Garland, Huan Liu
Inspired by recent endeavors to utilize Large Language Models (LLMs) as experts, in this work, we aim to leverage the instruction-following and textual understanding capabilities of recent state-of-the-art LLMs to facilitate causal explainability via counterfactual explanation generation for black-box text classifiers.
2 code implementations • 6 Sep 2023 • Tharindu Kumarage, Amrita Bhattacharjee, Djordje Padejski, Kristy Roschke, Dan Gillmor, Scott Ruston, Huan Liu, Joshua Garland
The rapid proliferation of AI-generated text online is profoundly reshaping the information landscape.
1 code implementation • 7 Mar 2023 • Tharindu Kumarage, Joshua Garland, Amrita Bhattacharjee, Kirill Trapeznikov, Scott Ruston, Huan Liu
However, tweets are inherently short, thus making it difficult for current state-of-the-art pre-trained language model-based detectors to accurately detect at what point the AI starts to generate tweets in a given Twitter timeline.
no code implementations • 16 Sep 2020 • Joshua Garland, Keyan Ghazi-Zahedi, Jean-Gabriel Young, Laurent Hébert-Dufresne, Mirta Galesic
Citizen-generated counter speech is a promising way to fight hate speech and promote peaceful, non-polarized discourse.
no code implementations • EMNLP (ALW) 2020 • Joshua Garland, Keyan Ghazi-Zahedi, Jean-Gabriel Young, Laurent Hébert-Dufresne, Mirta Galesic
Hateful rhetoric is plaguing online discourse, fostering extreme societal movements and possibly giving rise to real-world violence.