Search Results for author: Yong Dai

Found 27 papers, 10 papers with code

Creating and Evaluating Resources for Sentiment Analysis in the Low-resource Language: Sindhi

no code implementations • EACL (WASSA) 2021 • Wazir Ali, Naveed Ali, Yong Dai, Jay Kumar, Saifullah Tumrani, Zenglin Xu

In this paper, we develop Sindhi subjective lexicon using a merger of existing English resources: NRC lexicon, list of opinion words, SentiWordNet, Sindhi-English bilingual dictionary, and collection of Sindhi modifiers.

Sentiment Analysis Subjectivity Analysis +1

Paper
Add Code

“Is Whole Word Masking Always Better for Chinese BERT?”: Probing on Chinese Grammatical Error Correction

no code implementations • Findings (ACL) 2022 • Yong Dai, Linyang Li, Cong Zhou, Zhangyin Feng, Enbo Zhao, Xipeng Qiu, Piji Li, Duyu Tang

The meaning of a word in Chinese is different in that a word is a compositional unit consisting of multiple characters.

Grammatical Error Correction Language Modelling +2

Paper
Add Code

Self-playing Adversarial Language Game Enhances LLM Reasoning

1 code implementation • 16 Apr 2024 • Pengyu Cheng, Tianhao Hu, Han Xu, Zhisong Zhang, Yong Dai, Lei Han, Nan Du

Hence, we are curious about whether LLMs' reasoning ability can be further enhanced by Self-Play in this Adversarial language Game (SPAG).

Paper
Code

Prior-agnostic Multi-scale Contrastive Text-Audio Pre-training for Parallelized TTS Frontend Modeling

no code implementations • 14 Apr 2024 • Quanxiu Wang, Hui Huang, Mingjie Wang, Yong Dai, Jinzuomu Zhong, Benlai Tang

Furthermore, a parallelized TTS frontend model is delicately devised to execute TN, PD, and PBP prediction tasks, respectively in the second stage.

Polyphone disambiguation

Paper
Add Code

Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models

no code implementations • 26 Feb 2024 • Anchun Gui, Jian Li, Yong Dai, Nan Du, Han Xiao

Meanwhile, we propose a novel tool sampling strategy to enhance the generalizability of LLMs over unseen tools.

Decision Making Hallucination +1

Paper
Add Code

Enhancing Zero-shot Counting via Language-guided Exemplar Learning

no code implementations • 8 Feb 2024 • Mingjie Wang, Jun Zhou, Yong Dai, Eric Buys, Minglun Gong

Recently, Class-Agnostic Counting (CAC) problem has garnered increasing attention owing to its intriguing generality and superior efficiency compared to Category-Specific Counting (CSC).

Object Counting Zero-Shot Counting +1

Paper
Add Code

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning

1 code implementation • 30 Jan 2024 • Bang Yang, Yong Dai, Xuxin Cheng, Yaowei Li, Asif Raza, Yuexian Zou

To alleviate CF raised by covariate shift and lexical overlap, we further propose a novel approach that ensures the identical distribution of all token embeddings during initialization and regularizes token embedding learning during training.

Text Retrieval

Paper
Code

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

1 code implementation • 25 Jan 2024 • Hongliang He, Wenlin Yao, Kaixin Ma, Wenhao Yu, Yong Dai, Hongming Zhang, Zhenzhong Lan, Dong Yu

The rapid advancement of large language models (LLMs) has led to a new era marked by the development of autonomous applications in real-world scenarios, which drives innovation in creating advanced web agents.

Paper
Code

Emage: Non-Autoregressive Text-to-Image Generation

no code implementations • 22 Dec 2023 • Zhangyin Feng, Runyi Hu, Liangxin Liu, Fan Zhang, Duyu Tang, Yong Dai, Xiaocheng Feng, Jiwei Li, Bing Qin, Shuming Shi

Compared with autoregressive baselines that needs to run one thousand times, our model only runs 16 times to generate images of competitive quality with an order of magnitude lower inference latency.

Denoising Text-to-Image Generation

Paper
Add Code

On Diversified Preferences of Large Language Model Alignment

1 code implementation • 12 Dec 2023 • Dun Zeng, Yong Dai, Pengyu Cheng, Longyue Wang, Tianhao Hu, Wanshun Chen, Nan Du, Zenglin Xu

Our analysis reveals a correlation between the calibration performance of reward models (RMs) and the alignment performance of LLMs.

Language Modelling Large Language Model

Paper
Code

Adversarial Preference Optimization

1 code implementation • 14 Nov 2023 • Pengyu Cheng, Yifan Yang, Jian Li, Yong Dai, Tianhao Hu, Peixin Cao, Nan Du

Human preference alignment is essential to improve the interaction quality of large language models (LLMs).

Paper
Code

TencentLLMEval: A Hierarchical Evaluation of Real-World Capabilities for Human-Aligned LLMs

1 code implementation • 9 Nov 2023 • Shuyi Xie, Wenlin Yao, Yong Dai, Shaobo Wang, Donlin Zhou, Lifeng Jin, Xinhua Feng, Pengzhi Wei, Yujie Lin, Zhichao Hu, Dong Yu, Zhengyou Zhang, Jing Nie, Yuhong Liu

We construct a hierarchical task tree encompassing 7 major areas covering over 200 categories and over 800 tasks, which covers diverse capabilities such as question answering, reasoning, multiturn dialogue, and text generation, to evaluate LLMs in a comprehensive and in-depth manner.

Benchmarking Question Answering +1

Paper
Code

Everyone Deserves A Reward: Learning Customized Human Preferences

1 code implementation • 6 Sep 2023 • Pengyu Cheng, Jiawen Xie, Ke Bai, Yong Dai, Nan Du

Besides, from the perspective of data efficiency, we propose a three-stage customized RM learning scheme, then empirically verify its effectiveness on both general preference datasets and our DSP set.

Imitation Learning

Paper
Code

Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers

no code implementations • 25 Aug 2023 • Jiawen Xie, Pengyu Cheng, Xiao Liang, Yong Dai, Nan Du

Although dominant in natural language processing, transformer-based models remain challenged by the task of long-sequence processing, because the computational cost of self-attention operations in transformers swells quadratically with the input sequence length.

Reading Comprehension Text Summarization

Paper
Add Code

SkillNet-X: A Multilingual Multitask Model with Sparsely Activated Skills

no code implementations • 28 Jun 2023 • Zhangyin Feng, Yong Dai, Fan Zhang, Duyu Tang, Xiaocheng Feng, Shuangzhi Wu, Bing Qin, Yunbo Cao, Shuming Shi

Traditional multitask learning methods basically can only exploit common knowledge in task- or language-wise, which lose either cross-language or cross-task knowledge.

Natural Language Understanding

Paper
Add Code

When Federated Learning Meets Pre-trained Language Models' Parameter-Efficient Tuning Methods

1 code implementation • 20 Dec 2022 • Zhuo Zhang, Yuanhang Yang, Yong Dai, Lizhen Qu, Zenglin Xu

To facilitate the research of PETuning in FL, we also develop a federated tuning framework FedPETuning, which allows practitioners to exploit different PETuning methods under the FL training paradigm conveniently.

Federated Learning

Paper
Code

Pretrained Language Encoders are Natural Tagging Frameworks for Aspect Sentiment Triplet Extraction

no code implementations • 20 Aug 2022 • Yanjie Gou, Yinjie Lei, Lingqiao Liu, Yong Dai, Chunxu Shen, Yongqi Tong

Existing works usually formulate the span detection as a 1D token tagging problem, and model the sentiment recognition with a 2D tagging matrix of token pairs.

Aspect Sentiment Triplet Extraction Inductive Bias

Paper
Add Code

Effidit: Your AI Writing Assistant

no code implementations • 3 Aug 2022 • Shuming Shi, Enbo Zhao, Duyu Tang, Yan Wang, Piji Li, Wei Bi, Haiyun Jiang, Guoping Huang, Leyang Cui, Xinting Huang, Cong Zhou, Yong Dai, Dongyang Ma

In Effidit, we significantly expand the capacities of a writing assistant by providing functions in five categories: text completion, error checking, text polishing, keywords to sentences (K2S), and cloud input methods (cloud IME).

Keywords to Sentences Retrieval +3

Paper
Add Code

One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code

no code implementations • 12 May 2022 • Yong Dai, Duyu Tang, Liangxin Liu, Minghuan Tan, Cong Zhou, Jingquan Wang, Zhangyin Feng, Fan Zhang, Xueyu Hu, Shuming Shi

Moreover, our model supports self-supervised pretraining with the same sparsely activated way, resulting in better initialized parameters for different modalities.

Image Retrieval Retrieval

Paper
Add Code

Pretraining Chinese BERT for Detecting Word Insertion and Deletion Errors

no code implementations • 26 Apr 2022 • Cong Zhou, Yong Dai, Duyu Tang, Enbo Zhao, Zhangyin Feng, Li Kuang, Shuming Shi

We achieve this by introducing a special token \texttt{[null]}, the prediction of which stands for the non-existence of a word.

Language Modelling Masked Language Modeling +1

Paper
Add Code

MarkBERT: Marking Word Boundaries Improves Chinese BERT

1 code implementation • 12 Mar 2022 • Linyang Li, Yong Dai, Duyu Tang, Xipeng Qiu, Zenglin Xu, Shuming Shi

We present a Chinese BERT model dubbed MarkBERT that uses word information in this work.

Chinese Named Entity Recognition named-entity-recognition +7

Paper
Code

SkillNet-NLU: A Sparsely Activated Model for General-Purpose Natural Language Understanding

no code implementations • 7 Mar 2022 • Fan Zhang, Duyu Tang, Yong Dai, Cong Zhou, Shuangzhi Wu, Shuming Shi

The key feature of our approach is that it is sparsely activated guided by predefined skills.

Language Modelling Masked Language Modeling +2

Paper
Add Code

Exploring and Adapting Chinese GPT to Pinyin Input Method

1 code implementation • ACL 2022 • Minghuan Tan, Yong Dai, Duyu Tang, Zhangyin Feng, Guoping Huang, Jing Jiang, Jiwei Li, Shuming Shi

We find that a frozen GPT achieves state-of-the-art performance on perfect pinyin.

Text Generation

Paper
Code

"Is Whole Word Masking Always Better for Chinese BERT?": Probing on Chinese Grammatical Error Correction

no code implementations • 1 Mar 2022 • Yong Dai, Linyang Li, Cong Zhou, Zhangyin Feng, Enbo Zhao, Xipeng Qiu, Piji Li, Duyu Tang

The meaning of a word in Chinese is different in that a word is a compositional unit consisting of multiple characters.

Grammatical Error Correction Language Modelling +2

Paper
Add Code

Unsupervised Sentiment Analysis by Transferring Multi-source Knowledge

no code implementations • 9 May 2021 • Yong Dai, Jian Liu, Jian Zhang, Hongguang Fu, Zenglin Xu

The first mechanism is a selective domain adaptation (SDA) method, which transfers knowledge from the closest source domain.

Domain Adaptation Sentiment Analysis

Paper
Add Code

Contextualize Knowledge Bases with Transformer for End-to-end Task-Oriented Dialogue Systems

no code implementations • EMNLP 2021 • Yanjie Gou, Yinjie Lei, Lingqiao Liu, Yong Dai, Chunxu Shen

Incorporating knowledge bases (KB) into end-to-end task-oriented dialogue systems is challenging, since it requires to properly represent the entity of KB, which is associated with its KB context and dialogue context.

Ranked #2 on Task-Oriented Dialogue Systems on KVRET

Response Generation Task-Oriented Dialogue Systems

Paper
Add Code

Adversarial Training Based Multi-Source Unsupervised Domain Adaptation for Sentiment Analysis

no code implementations • 10 Jun 2020 • Yong Dai, Jian Liu, Xiancong Ren, Zenglin Xu

Existing algorithms of MS-UDA either only exploit the shared features, i. e., the domain-invariant information, or based on some weak assumption in NLP, e. g., smoothness assumption.

Multi-Source Unsupervised Domain Adaptation Sentiment Analysis +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.