no code implementations • 17 Apr 2024 • Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu
Content moderation faces a challenging task as social media's ability to spread hate speech contrasts with its role in promoting global connectivity.
no code implementations • 12 Mar 2024 • Tharindu Kumarage, Amrita Bhattacharjee, Joshua Garland
Large language models (LLMs) excel in many diverse applications beyond language generation, e. g., translation, summarization, and sentiment analysis.
no code implementations • 2 Mar 2024 • Tharindu Kumarage, Garima Agrawal, Paras Sheth, Raha Moraffah, Aman Chadha, Joshua Garland, Huan Liu
We have witnessed lately a rapid proliferation of advanced Large Language Models (LLMs) capable of generating high-quality text.
no code implementations • 14 Nov 2023 • Garima Agrawal, Tharindu Kumarage, Zeyad Alghamdi, Huan Liu
The contemporary LLMs are prone to producing hallucinations, stemming mainly from the knowledge gaps within the models.
no code implementations • 8 Oct 2023 • Tharindu Kumarage, Paras Sheth, Raha Moraffah, Joshua Garland, Huan Liu
The novel universal evasive prompt is achieved in two steps: First, we create an evasive soft prompt tailored to a specific PLM through prompt tuning; and then, we leverage the transferability of soft prompts to transfer the learned evasive soft prompt from one PLM to another.
1 code implementation • 7 Sep 2023 • Amrita Bhattacharjee, Tharindu Kumarage, Raha Moraffah, Huan Liu
Given the potential malicious nature in which these LLMs can be used to generate disinformation at scale, it is important to build effective detectors for such AI-generated text.
2 code implementations • 6 Sep 2023 • Tharindu Kumarage, Amrita Bhattacharjee, Djordje Padejski, Kristy Roschke, Dan Gillmor, Scott Ruston, Huan Liu, Joshua Garland
The rapid proliferation of AI-generated text online is profoundly reshaping the information landscape.
1 code implementation • 14 Aug 2023 • Tharindu Kumarage, Huan Liu
Large language models (LLMs) such as GPT-4, PaLM, and Llama have significantly propelled the generation of AI-crafted text.
1 code implementation • 3 Aug 2023 • Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu
By disentangling input into platform-dependent features (useful for predicting hate targets) and platform-independent features (used to predict the presence of hate), we learn invariant representations resistant to distribution shifts.
1 code implementation • 15 Jun 2023 • Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu
Hate speech detection refers to the task of detecting hateful content that aims at denigrating an individual or a group based on their religion, gender, sexual orientation, or other characteristics.
1 code implementation • 7 Mar 2023 • Tharindu Kumarage, Joshua Garland, Amrita Bhattacharjee, Kirill Trapeznikov, Scott Ruston, Huan Liu
However, tweets are inherently short, thus making it difficult for current state-of-the-art pre-trained language model-based detectors to accurately detect at what point the AI starts to generate tweets in a given Twitter timeline.
1 code implementation • 31 Jan 2023 • Melanie Subbiah, Amrita Bhattacharjee, Yilun Hua, Tharindu Kumarage, Huan Liu, Kathleen McKeown
Manipulated news online is a growing problem which necessitates the use of automated systems to curtail its spread.