Search Results for author: Zhengxiang Wang

Found 5 papers, 4 papers with code

Clustering Document Parts: Detecting and Characterizing Influence Campaigns from Documents

1 code implementation • 27 Feb 2024 • Zhengxiang Wang, Owen Rambow

We propose a novel clustering pipeline to detect and characterize influence campaigns from documents.

Paper
Code

Probabilistic Linguistic Knowledge and Token-level Text Augmentation

no code implementations • 29 Jun 2023 • Zhengxiang Wang

This paper investigates the effectiveness of token-level text augmentation and the role of probabilistic linguistic knowledge within a linguistically-motivated evaluation context.

Text Augmentation

Paper
Add Code

Learning Transductions and Alignments with RNN Seq2seq Models

1 code implementation • 13 Mar 2023 • Zhengxiang Wang

The paper studies the capabilities of Recurrent-Neural-Network sequence to sequence (RNN seq2seq) models in learning four transduction tasks: identity, reversal, total reduplication, and quadratic copying.

Out-of-Distribution Generalization

Paper
Code

Random Text Perturbations Work, but not Always

1 code implementation • 2 Sep 2022 • Zhengxiang Wang

We present three large-scale experiments on binary text matching classification task both in Chinese and English to evaluate the effectiveness and generalizability of random text perturbations as a data augmentation approach for NLP.

Classification Data Augmentation +1

Paper
Code

Linguistic Knowledge in Data Augmentation for Natural Language Processing: An Example on Chinese Question Matching

1 code implementation • 29 Nov 2021 • Zhengxiang Wang

To investigate the role of linguistic knowledge in data augmentation (DA) for Natural Language Processing (NLP), we designed two adapted DA programs and applied them to LCQMC (a Large-scale Chinese Question Matching Corpus) for a binary Chinese question matching classification task.

Language Modelling Question Similarity +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.