Paraphrase Generation

68 papers with code • 3 benchmarks • 16 datasets

Paraphrase Generation involves transforming a natural language sentence to a new sentence, that has the same semantic meaning but a different syntactic or lexical surface form.

Benchmarks

Add a Result

These leaderboards are used to track progress in Paraphrase Generation

Dataset	Best Model	Compare
Paralex	HRQ-VAE	See all
Quora Question Pairs	Separator	See all
MSCOCO	HRQ-VAE	See all

Datasets

Subtasks

Multilingual Paraphrase Generation

Latest papers

Most implemented Social Latest No code

TESS: Text-to-Text Self-Conditioned Simplex Diffusion

allenai/tess-diffusion • • 15 May 2023

Diffusion models have emerged as a powerful paradigm for generation, obtaining strong performance in various continuous domains.

15 May 2023

Paper
Code

Lost in Translationese? Reducing Translation Effect Using Abstract Meaning Representation

shirawein/amr-translationese • 23 Apr 2023

Though individual translated texts are often fluent and preserve meaning, at a large scale, translated texts have statistical tendencies which distinguish them from text originally written in the language ("translationese") and can affect model performance.

23 Apr 2023

Paper
Code

Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense

martiansideofthemoon/ai-detection-paraphrases • • NeurIPS 2023

To increase the robustness of AI-generated text detection to paraphrase attacks, we introduce a simple defense that relies on retrieving semantically-similar generations and must be maintained by a language model API provider.

110

23 Mar 2023

Paper
Code

kNN-BOX: A Unified Framework for Nearest Neighbor Generation

njunlp/knn-box • • 27 Feb 2023

Augmenting the base neural model with a token-level symbolic datastore is a novel generation paradigm and has achieved promising results in machine translation (MT).

27 Feb 2023

Paper
Code

Syntactically Robust Training on Partially-Observed Data for Open Information Extraction

qijimrc/robustoie • • 17 Jan 2023

In this paper, we propose a syntactically robust training framework that enables models to be trained on a syntactic-abundant distribution based on diverse paraphrase generation.

17 Jan 2023

Paper
Code

Language as a Latent Sequence: deep latent variable models for semi-supervised paraphrase generation

jialin-yu/latent-sequence-paraphrase • • 5 Jan 2023

To leverage information from text pairs, we additionally introduce a novel supervised model we call dual directional learning (DDL), which is designed to integrate with our proposed VSAR model.

05 Jan 2023

Paper
Code

How Large Language Models are Transforming Machine-Paraphrased Plagiarism

jpwahle/emnlp22-transforming • • 7 Oct 2022

The recent success of large language models for text generation poses a severe threat to academic integrity, as plagiarists can generate realistic paraphrases indistinguishable from original work.

07 Oct 2022

Paper
Code

Continuous Decomposition of Granularity for Neural Paraphrase Generation

guxd/c-dnpg • • COLING 2022

While Transformers have had significant success in paragraph generation, they treat sentences as linear sequences of tokens and often neglect their hierarchical information.

05 Sep 2022

Paper
Code

PCC: Paraphrasing with Bottom-k Sampling and Cyclic Learning for Curriculum Data Augmentation

hongyuanluke/pcc • • 17 Aug 2022

This paper presents \textbf{PCC}: \textbf{P}araphrasing with Bottom-k Sampling and \textbf{C}yclic Learning for \textbf{C}urriculum Data Augmentation, a novel CDA framework via paraphrasing, which exploits the textual paraphrase similarity as the curriculum difficulty measure.

17 Aug 2022

Paper
Code

'John ate 5 apples' != 'John ate some apples': Self-Supervised Paraphrase Quality Detection for Algebraic Word Problems

ads-ai/paraqd • 16 Jun 2022

There is a need for paraphrase scoring methods in the context of AWP to enable the training of good paraphrasers.

16 Jun 2022

Paper
Code

Paraphrase Generation

Benchmarks Add a Result

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result