Search Results for author: Jia-Chen Gu

Found 32 papers, 23 papers with code

Conversation- and Tree-Structure Losses for Dialogue Disentanglement

no code implementations • dialdoc (ACL) 2022 • Tianda Li, Jia-Chen Gu, Zhen-Hua Ling, Quan Liu

When multiple conversations occur simultaneously, a listener must decide which conversation each utterance is part of in order to interpret and respond to it appropriately.

Disentanglement

Paper
Add Code

Multiscale Matching Driven by Cross-Modal Similarity Consistency for Audio-Text Retrieval

no code implementations • 15 Mar 2024 • Qian Wang, Jia-Chen Gu, Zhen-Hua Ling

Audio-text retrieval (ATR), which retrieves a relevant caption given an audio clip (A2T) and vice versa (T2A), has recently attracted much research attention.

AudioCaps Contrastive Learning +2

Paper
Add Code

Neighboring Perturbations of Knowledge Editing on Large Language Models

1 code implementation • 31 Jan 2024 • Jun-Yu Ma, Jia-Chen Gu, Ningyu Zhang, Zhen-Hua Ling

Despite their exceptional capabilities, large language models (LLMs) are prone to generating unintended text due to false or outdated knowledge.

knowledge editing

Paper
Code

Corrective Retrieval Augmented Generation

1 code implementation • 29 Jan 2024 • Shi-Qi Yan, Jia-Chen Gu, Yun Zhu, Zhen-Hua Ling

Experiments on four datasets covering short- and long-form generation tasks show that CRAG can significantly improve the performance of RAG-based approaches.

Retrieval

146

Paper
Code

Leveraging Large Language Models for NLG Evaluation: A Survey

1 code implementation • 13 Jan 2024 • Zhen Li, Xiaohan Xu, Tao Shen, Can Xu, Jia-Chen Gu, Chongyang Tao

In the rapidly evolving domain of Natural Language Generation (NLG) evaluation, introducing Large Language Models (LLMs) has opened new avenues for assessing generated content quality, e. g., coherence, creativity, and context relevance.

nlg evaluation Specificity +1

Paper
Code

Model Editing Can Hurt General Abilities of Large Language Models

1 code implementation • 9 Jan 2024 • Jia-Chen Gu, Hao-Xiang Xu, Jun-Yu Ma, Pan Lu, Zhen-Hua Ling, Kai-Wei Chang, Nanyun Peng

One critical challenge that has emerged is the presence of hallucinations in the output of large language models (LLMs) due to false or outdated knowledge.

Model Editing Question Answering

Paper
Code

A Comprehensive Study of Knowledge Editing for Large Language Models

2 code implementations • 2 Jan 2024 • Ningyu Zhang, Yunzhi Yao, Bozhong Tian, Peng Wang, Shumin Deng, Mengru Wang, Zekun Xi, Shengyu Mao, Jintian Zhang, Yuansheng Ni, Siyuan Cheng, Ziwen Xu, Xin Xu, Jia-Chen Gu, Yong Jiang, Pengjun Xie, Fei Huang, Lei Liang, Zhiqiang Zhang, Xiaowei Zhu, Jun Zhou, Huajun Chen

In this paper, we first define the knowledge editing problem and then provide a comprehensive review of cutting-edge approaches.

Ranked #1 on knowledge editing on zsRE (using extra training data)

knowledge editing

1,365

Paper
Code

Is ChatGPT a Good Multi-Party Conversation Solver?

1 code implementation • 25 Oct 2023 • Chao-Hong Tan, Jia-Chen Gu, Zhen-Hua Ling

Large Language Models (LLMs) have emerged as influential instruments within the realm of natural language processing; nevertheless, their capacity to handle multi-party conversations (MPCs) -- a scenario marked by the presence of multiple interlocutors involved in intricate information exchanges -- remains uncharted.

Zero-Shot Learning

Paper
Code

Untying the Reversal Curse via Bidirectional Language Model Editing

1 code implementation • 16 Oct 2023 • Jun-Yu Ma, Jia-Chen Gu, Zhen-Hua Ling, Quan Liu, Cong Liu

A new evaluation metric of reversibility is introduced, and a benchmark dubbed as Bidirectional Assessment for Knowledge Editing (BAKE) is constructed to evaluate the reversibility of edited models in recalling knowledge in the reverse direction of editing.

knowledge editing Language Modelling +1

Paper
Code

MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation

1 code implementation • 22 May 2023 • Jia-Chen Gu, Chao-Hong Tan, Caiyuan Chu, Zhen-Hua Ling, Chongyang Tao, Quan Liu, Cong Liu

Given an MPC with a few addressee labels missing, existing methods fail to build a consecutively connected conversation graph, but only a few separate conversation fragments instead.

Paper
Code

SHINE: Syntax-augmented Hierarchical Interactive Encoder for Zero-shot Cross-lingual Information Extraction

no code implementations • 21 May 2023 • Jun-Yu Ma, Jia-Chen Gu, Zhen-Hua Ling, Quan Liu, Cong Liu, Guoping Hu

The proposed encoder is capable of interactively capturing complementary information between features and contextual information, to derive language-agnostic representations for various IE tasks.

Paper
Add Code

DiffuSIA: A Spiral Interaction Architecture for Encoder-Decoder Text Diffusion

no code implementations • 19 May 2023 • Chao-Hong Tan, Jia-Chen Gu, Zhen-Hua Ling

In fact, the encoder-decoder architecture is naturally more flexible for its detachable encoder and decoder modules, which is extensible to multilingual and multimodal generation tasks for conditions and target texts.

Conditional Text Generation Dialogue Generation +4

Paper
Add Code

GIFT: Graph-Induced Fine-Tuning for Multi-Party Conversation Understanding

1 code implementation • 16 May 2023 • Jia-Chen Gu, Zhen-Hua Ling, Quan Liu, Cong Liu, Guoping Hu

Addressing the issues of who saying what to whom in multi-party conversations (MPCs) has recently attracted a lot of research attention.

Speaker Identification

Paper
Code

USTC-NELSLIP at SemEval-2023 Task 2: Statistical Construction and Dual Adaptation of Gazetteer for Multilingual Complex NER

no code implementations • 4 May 2023 • Jun-Yu Ma, Jia-Chen Gu, Jiajun Qi, Zhen-Hua Ling, Quan Liu, Xiaoyi Zhao

A method named Statistical Construction and Dual Adaptation of Gazetteer (SCDAG) is proposed for Multilingual Complex NER.

named-entity-recognition Named Entity Recognition +4

Paper
Add Code

Multi-Stage Coarse-to-Fine Contrastive Learning for Conversation Intent Induction

no code implementations • 9 Mar 2023 • Caiyuan Chu, Ya Li, Yifan Liu, Jia-Chen Gu, Quan Liu, Yongxin Ge, Guoping Hu

The key to automatic intention induction is that, for any given set of new data, the sentence representation obtained by the model can be well distinguished from different labels.

Clustering Contrastive Learning +3

Paper
Add Code

WIDER & CLOSER: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity Recognition

1 code implementation • 7 Dec 2022 • Jun-Yu Ma, Beiduo Chen, Jia-Chen Gu, Zhen-Hua Ling, Wu Guo, Quan Liu, Zhigang Chen, Cong Liu

In this study, a mixture of short-channel distillers (MSD) method is proposed to fully interact the rich hierarchical information in the teacher model and to transfer knowledge to the student model sufficiently and efficiently.

Cross-Lingual NER Domain Adaptation +3

Paper
Code

HeterMPC: A Heterogeneous Graph Neural Network for Response Generation in Multi-Party Conversations

1 code implementation • ACL 2022 • Jia-Chen Gu, Chao-Hong Tan, Chongyang Tao, Zhen-Hua Ling, Huang Hu, Xiubo Geng, Daxin Jiang

To address these challenges, we present HeterMPC, a heterogeneous graph-based neural network for response generation in MPCs which models the semantics of utterances and interlocutors simultaneously with two types of nodes in a graph.

Response Generation

Paper
Code

TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge

1 code implementation • Findings (ACL) 2022 • Chao-Hong Tan, Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Huang Hu, Xiubo Geng, Daxin Jiang

To address the problem, we propose augmenting TExt Generation via Task-specific and Open-world Knowledge (TegTok) in a unified framework.

Dialogue Generation Knowledge Graphs +4

Paper
Code

Detecting Speaker Personas from Conversational Texts

1 code implementation • EMNLP 2021 • Jia-Chen Gu, Zhen-Hua Ling, Yu Wu, Quan Liu, Zhigang Chen, Xiaodan Zhu

This is a many-to-many semantic matching task because both contexts and personas in SPD are composed of multiple sentences.

Paper
Code

MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding

1 code implementation • ACL 2021 • Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Xiubo Geng, Daxin Jiang

Recently, various neural models for multi-party conversation (MPC) have achieved impressive improvements on a variety of tasks such as addressee recognition, speaker identification and response prediction.

Language Modelling Speaker Identification

Paper
Code

Partner Matters! An Empirical Study on Fusing Personas for Personalized Response Selection in Retrieval-Based Chatbots

1 code implementation • 19 May 2021 • Jia-Chen Gu, Hui Liu, Zhen-Hua Ling, Quan Liu, Zhigang Chen, Xiaodan Zhu

Empirical studies on the Persona-Chat dataset show that the partner personas neglected in previous studies can improve the accuracy of response selection in the IMN- and BERT-based models.

Retrieval

Paper
Code

Learning to Retrieve Entity-Aware Knowledge and Generate Responses with Copy Mechanism for Task-Oriented Dialogue Systems

1 code implementation • 22 Dec 2020 • Chao-Hong Tan, Xiaoyu Yang, Zi'ou Zheng, Tianda Li, Yufei Feng, Jia-Chen Gu, Quan Liu, Dan Liu, Zhen-Hua Ling, Xiaodan Zhu

Task-oriented conversational modeling with unstructured knowledge access, as track 1 of the 9th Dialogue System Technology Challenges (DSTC 9), requests to build a system to generate response given dialogue history and knowledge access.

Response Generation Task-Oriented Dialogue Systems

Paper
Code

Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Jia-Chen Gu, Zhen-Hua Ling, Quan Liu, Zhigang Chen, Xiaodan Zhu

The challenges of building knowledge-grounded retrieval-based chatbots lie in how to ground a conversation on its background knowledge and how to match response candidates with both context and knowledge simultaneously.

Retrieval

Paper
Code

DialBERT: A Hierarchical Pre-Trained Model for Conversation Disentanglement

1 code implementation • 8 Apr 2020 • Tianda Li, Jia-Chen Gu, Xiaodan Zhu, Quan Liu, Zhen-Hua Ling, Zhiming Su, Si Wei

Disentanglement is a problem in which multiple conversations occur in the same channel simultaneously, and the listener should decide which utterance is part of the conversation he will respond to.

Conversation Disentanglement Disentanglement

Paper
Code

Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots

2 code implementations • 7 Apr 2020 • Jia-Chen Gu, Tianda Li, Quan Liu, Zhen-Hua Ling, Zhiming Su, Si Wei, Xiaodan Zhu

In this paper, we study the problem of employing pre-trained language models for multi-turn response selection in retrieval-based chatbots.

Ranked #2 on Conversational Response Selection on RRS Ranking Test

Conversational Response Selection Disentanglement +2

Paper
Code

Pre-Trained and Attention-Based Neural Networks for Building Noetic Task-Oriented Dialogue Systems

no code implementations • 4 Apr 2020 • Jia-Chen Gu, Tianda Li, Quan Liu, Xiaodan Zhu, Zhen-Hua Ling, Yu-Ping Ruan

The NOESIS II challenge, as the Track 2 of the 8th Dialogue System Technology Challenges (DSTC 8), is the extension of DSTC 7.

Ranked #1 on Conversation Disentanglement on irc-disentanglement

Conversation Disentanglement Task-Oriented Dialogue Systems

Paper
Add Code

Fine-Tuning BERT for Schema-Guided Zero-Shot Dialogue State Tracking

no code implementations • 1 Feb 2020 • Yu-Ping Ruan, Zhen-Hua Ling, Jia-Chen Gu, Quan Liu

We present our work on Track 4 in the Dialogue System Technology Challenges 8 (DSTC8).

Dialogue State Tracking Natural Language Understanding +3

Paper
Add Code

Utterance-to-Utterance Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots

1 code implementation • 16 Nov 2019 • Jia-Chen Gu, Zhen-Hua Ling, Quan Liu

The distances between context and response utterances are employed as a prior component when calculating the attention weights.

Ranked #9 on Conversational Response Selection on E-commerce

Conversational Response Selection Retrieval +1

Paper
Code

Dually Interactive Matching Network for Personalized Response Selection in Retrieval-Based Chatbots

1 code implementation • IJCNLP 2019 • Jia-Chen Gu, Zhen-Hua Ling, Xiaodan Zhu, Quan Liu

Compared with previous persona fusion approaches which enhance the representation of a context by calculating its similarity with a given persona, the DIM model adopts a dual matching architecture, which performs interactive matching between responses and contexts and between responses and personas respectively for ranking response candidates.

Retrieval

Paper
Code

Promoting Diversity for End-to-End Conversation Response Generation

no code implementations • 27 Jan 2019 • Yu-Ping Ruan, Zhen-Hua Ling, Quan Liu, Jia-Chen Gu, Xiaodan Zhu

At this stage, two different models are proposed, i. e., a variational generative (VariGen) model and a retrieval based (Retrieval) model.

Response Generation Retrieval

Paper
Add Code

Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots

1 code implementation • 7 Jan 2019 • Jia-Chen Gu, Zhen-Hua Ling, Quan Liu

In this paper, we propose an interactive matching network (IMN) for the multi-turn response selection task.

Ranked #8 on Conversational Response Selection on E-commerce

Conversational Response Selection Descriptive +1

Paper
Code

Building Sequential Inference Models for End-to-End Response Selection

1 code implementation • 3 Dec 2018 • Jia-Chen Gu, Zhen-Hua Ling, Yu-Ping Ruan, Quan Liu

This paper presents an end-to-end response selection model for Track 1 of the 7th Dialogue System Technology Challenges (DSTC7).

Ranked #5 on Conversational Response Selection on DSTC7 Ubuntu

Conversational Response Selection Descriptive +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.