Search Results for author: Tharindu Kumarage

Found 12 papers, 7 papers with code

Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement

no code implementations • 17 Apr 2024 • Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

Content moderation faces a challenging task as social media's ability to spread hate speech contrasts with its role in promoting global connectivity.

Disentanglement Hate Speech Detection

Paper
Add Code

Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection

no code implementations • 12 Mar 2024 • Tharindu Kumarage, Amrita Bhattacharjee, Joshua Garland

Large language models (LLMs) excel in many diverse applications beyond language generation, e. g., translation, summarization, and sentiment analysis.

Hate Speech Detection Sentiment Analysis +3

Paper
Add Code

A Survey of AI-generated Text Forensic Systems: Detection, Attribution, and Characterization

no code implementations • 2 Mar 2024 • Tharindu Kumarage, Garima Agrawal, Paras Sheth, Raha Moraffah, Aman Chadha, Joshua Garland, Huan Liu

We have witnessed lately a rapid proliferation of advanced Large Language Models (LLMs) capable of generating high-quality text.

Misinformation Text Generation

Paper
Add Code

Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey

no code implementations • 14 Nov 2023 • Garima Agrawal, Tharindu Kumarage, Zeyad Alghamdi, Huan Liu

The contemporary LLMs are prone to producing hallucinations, stemming mainly from the knowledge gaps within the models.

Knowledge Graphs

Paper
Add Code

How Reliable Are AI-Generated-Text Detectors? An Assessment Framework Using Evasive Soft Prompts

no code implementations • 8 Oct 2023 • Tharindu Kumarage, Paras Sheth, Raha Moraffah, Joshua Garland, Huan Liu

The novel universal evasive prompt is achieved in two steps: First, we create an evasive soft prompt tailored to a specific PLM through prompt tuning; and then, we leverage the transferability of soft prompts to transfer the learned evasive soft prompt from one PLM to another.

Paper
Add Code

ConDA: Contrastive Domain Adaptation for AI-generated Text Detection

1 code implementation • 7 Sep 2023 • Amrita Bhattacharjee, Tharindu Kumarage, Raha Moraffah, Huan Liu

Given the potential malicious nature in which these LLMs can be used to generate disinformation at scale, it is important to build effective detectors for such AI-generated text.

Contrastive Learning Text Detection +1

Paper
Code

J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News

2 code implementations • 6 Sep 2023 • Tharindu Kumarage, Amrita Bhattacharjee, Djordje Padejski, Kristy Roschke, Dan Gillmor, Scott Ruston, Huan Liu, Joshua Garland

The rapid proliferation of AI-generated text online is profoundly reshaping the information landscape.

Adversarial Robustness Misinformation

Paper
Code

Neural Authorship Attribution: Stylometric Analysis on Large Language Models

1 code implementation • 14 Aug 2023 • Tharindu Kumarage, Huan Liu

Large language models (LLMs) such as GPT-4, PaLM, and Llama have significantly propelled the generation of AI-crafted text.

Authorship Attribution Language Modelling +1

Paper
Code

Causality Guided Disentanglement for Cross-Platform Hate Speech Detection

1 code implementation • 3 Aug 2023 • Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

By disentangling input into platform-dependent features (useful for predicting hate targets) and platform-independent features (used to predict the presence of hate), we learn invariant representations resistant to distribution shifts.

Disentanglement Hate Speech Detection

Paper
Code

PEACE: Cross-Platform Hate Speech Detection- A Causality-guided Framework

1 code implementation • 15 Jun 2023 • Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

Hate speech detection refers to the task of detecting hateful content that aims at denigrating an individual or a group based on their religion, gender, sexual orientation, or other characteristics.

Hate Speech Detection

Paper
Code

Stylometric Detection of AI-Generated Text in Twitter Timelines

1 code implementation • 7 Mar 2023 • Tharindu Kumarage, Joshua Garland, Amrita Bhattacharjee, Kirill Trapeznikov, Scott Ruston, Huan Liu

However, tweets are inherently short, thus making it difficult for current state-of-the-art pre-trained language model-based detectors to accurately detect at what point the AI starts to generate tweets in a given Twitter timeline.

Language Modelling Misinformation +1

Paper
Code

Towards Detecting Harmful Agendas in News Articles

1 code implementation • 31 Jan 2023 • Melanie Subbiah, Amrita Bhattacharjee, Yilun Hua, Tharindu Kumarage, Huan Liu, Kathleen McKeown

Manipulated news online is a growing problem which necessitates the use of automated systems to curtail its spread.

Misinformation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.