In this work, we propose a simple and effective method to cover a much larger proportion of the attack search space, called Adversarial and Mixup Data Augmentation (AMDA).

Paper
Code

GPT3Mix: Leveraging Large-scale Language Models for Text Augmentation

naver-ai/hypermix • • Findings (EMNLP) 2021

Large-scale language models such as GPT-3 are excellent few-shot learners, allowing them to be controlled via natural text prompts.

Paper
Code

Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems

mifei/st-tod • • EMNLP 2021

In this paper, we devise a self-training approach to utilize the abundant unlabeled dialog data to further improve state-of-the-art pre-trained models in few-shot learning scenarios for ToD systems.

Paper
Code

Linguistic Knowledge in Data Augmentation for Natural Language Processing: An Example on Chinese Question Matching

jaaack-wang/linguistic-knowledge-in-DA-for-NLP • • 29 Nov 2021

To investigate the role of linguistic knowledge in data augmentation (DA) for Natural Language Processing (NLP), we designed two adapted DA programs and applied them to LCQMC (a Large-scale Chinese Question Matching Corpus) for a binary Chinese question matching classification task.

Paper
Code

UCD-CS at TREC 2021 Incident Streams Track

wangcongcong123/crisis-mtl • • 7 Dec 2021

In recent years, the task of mining important information from social media posts during crises has become a focus of research for the purposes of assisting emergency response (ES).

Paper
Code

Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning

snap-research/mmvid • • CVPR 2022

In addition, our model can extract visual information as suggested by the text prompt, e. g., "an object in image one is moving northeast", and generate corresponding videos.

Paper
Code

BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions Dataset

faiyazkhan11/ban-cap • • LREC 2022

As computers have become efficient at understanding visual information and transforming it into a written representation, research interest in tasks like automatic image captioning has seen a significant leap over the last few years.

Paper
Code

Selective Text Augmentation with Word Roles for Low-Resource Text Classification

beyondguo/STA • • 4 Sep 2022

Different words may play different roles in text classification, which inspires us to strategically select the proper roles for text augmentation.

Paper
Code

DoubleMix: Simple Interpolation-Based Data Augmentation for Text Classification

declare-lab/doublemix • • COLING 2022

This paper proposes a simple yet effective interpolation-based data augmentation approach termed DoubleMix, to improve the robustness of models in text classification.

Paper
Code

Adaptation of domain-specific transformer models with text oversampling for sentiment analysis of social media posts on Covid-19 vaccines

ace117mc/transformer-models-covid • • 22 Sep 2022

Covid-19 has spread across the world and several vaccines have been developed to counter its surge.

Paper
Code

Text Augmentation

Benchmarks Add a Result

Libraries

Most implemented papers

Content

Benchmarks

Add a Result