The latest generative large language models (LLMs) have found their application in data augmentation tasks, where small numbers of text samples are LLM-paraphrased and then used to fine-tune downstream models.

12 Jan 2024

Paper
Code

From Big to Small Without Losing It All: Text Augmentation with ChatGPT for Efficient Sentiment Analysis

clarin-pl/text-augumentation-with-chatgpt • • 7 Dec 2023

In the era of artificial intelligence, data is gold but costly to annotate.

07 Dec 2023

Paper
Code

Teaching Specific Scientific Knowledge into Large Language Models through Additional Training

kanhatakeyama/Additional-training-Llama2 • 6 Dec 2023

Through additional training, we explore embedding specialized scientific knowledge into the Llama 2 Large Language Model (LLM).

06 Dec 2023

Paper
Code

COVID-19 Vaccine Misinformation in Middle Income Countries

zzoliman/covid-vaccine-misinfo-mic • 30 Nov 2023

This paper introduces a multilingual dataset of COVID-19 vaccine misinformation, consisting of annotated tweets from three middle-income countries: Brazil, Indonesia, and Nigeria.

30 Nov 2023

Paper
Code

Pretraining Language Models with Text-Attributed Heterogeneous Graphs

hope-rita/thlm • • 19 Oct 2023

In many real-world scenarios (e. g., academic networks, social platforms), different types of entities are not only associated with texts but also connected by various relationships, which can be abstracted as Text-Attributed Heterogeneous Graphs (TAHGs).

19 Oct 2023

Paper
Code

Distributional Data Augmentation Methods for Low Resource Language

mosh98/text_aug_low_res • 9 Sep 2023

One of the current state-of-the-art text augmentation techniques is easy data augmentation (EDA), which augments the training data by injecting and replacing synonyms and randomly permuting sentences.

09 Sep 2023

Paper
Code

Story Visualization by Online Text Augmentation with Context Memory

yonseivnl/cmota • • ICCV 2023

Story visualization (SV) is a challenging text-to-image generation task for the difficulty of not only rendering visual details from the text descriptions but also encoding a long-term context across multiple sentences.

15 Aug 2023

Paper
Code

Text Augmentation

Benchmarks Add a Result

Libraries

Latest papers

Content

Benchmarks

Add a Result