1 code implementation • 27 Feb 2024 • Zhengxiang Wang, Owen Rambow
We propose a novel clustering pipeline to detect and characterize influence campaigns from documents.
no code implementations • 29 Jun 2023 • Zhengxiang Wang
This paper investigates the effectiveness of token-level text augmentation and the role of probabilistic linguistic knowledge within a linguistically-motivated evaluation context.
1 code implementation • 13 Mar 2023 • Zhengxiang Wang
The paper studies the capabilities of Recurrent-Neural-Network sequence to sequence (RNN seq2seq) models in learning four transduction tasks: identity, reversal, total reduplication, and quadratic copying.
1 code implementation • 2 Sep 2022 • Zhengxiang Wang
We present three large-scale experiments on binary text matching classification task both in Chinese and English to evaluate the effectiveness and generalizability of random text perturbations as a data augmentation approach for NLP.
1 code implementation • 29 Nov 2021 • Zhengxiang Wang
To investigate the role of linguistic knowledge in data augmentation (DA) for Natural Language Processing (NLP), we designed two adapted DA programs and applied them to LCQMC (a Large-scale Chinese Question Matching Corpus) for a binary Chinese question matching classification task.