Search Results for author: Huimin Wang

Found 10 papers, 3 papers with code

A Collaborative Multi-agent Reinforcement Learning Framework for Dialog Action Decomposition

no code implementations EMNLP 2021 Huimin Wang, Kam-Fai Wong

Most reinforcement learning methods for dialog policy learning train a centralized agent that selects a predefined joint action concatenating domain name, intent type, and slot name.

Multi-agent Reinforcement Learning reinforcement-learning +1

Self-DC: When to retrieve and When to generate? Self Divide-and-Conquer for Compositional Unknown Questions

no code implementations21 Feb 2024 Hongru Wang, Boyang Xue, Baohang Zhou, Tianhua Zhang, Cunxiang Wang, Guanhua Chen, Huimin Wang, Kam-Fai Wong

Retrieve-then-read and generate-then-read are two typical solutions to handle unknown and known questions in open-domain question-answering, while the former retrieves necessary external knowledge and the later prompt the large language models to generate internal known knowledge encoded in the parameters.

Binary Classification Open-Domain Question Answering +1

TPE: Towards Better Compositional Reasoning over Conceptual Tools with Multi-persona Collaboration

no code implementations28 Sep 2023 Hongru Wang, Huimin Wang, Lingzhi Wang, Minda Hu, Rui Wang, Boyang Xue, Hongyuan Lu, Fei Mi, Kam-Fai Wong

Large language models (LLMs) have demonstrated exceptional performance in planning the use of various functional tools, such as calculators and retrievers, particularly in question-answering tasks.

Question Answering Response Generation

Dialog Action-Aware Transformer for Dialog Policy Learning

no code implementations5 Sep 2023 Huimin Wang, Wai-Chung Kwan, Kam-Fai Wong

Recent works usually address Dialog policy learning DPL by training a reinforcement learning (RL) agent to determine the best dialog action.

Language Modelling Reinforcement Learning (RL)

JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialog Policy Learning

1 code implementation1 Sep 2023 Wai-Chung Kwan, Huimin Wang, Hongru Wang, Zezhong Wang, Xian Wu, Yefeng Zheng, Kam-Fai Wong

In addition, JoTR employs reinforcement learning with a reward-shaping mechanism to efficiently finetune the word-level dialogue policy, which allows the model to learn from its interactions, improving its performance over time.

Action Generation

CoAD: Automatic Diagnosis through Symptom and Disease Collaborative Generation

1 code implementation17 Jul 2023 Huimin Wang, Wai-Chung Kwan, Kam-Fai Wong, Yefeng Zheng

Automatic diagnosis (AD), a critical application of AI in healthcare, employs machine learning techniques to assist doctors in gathering patient symptom information for precise disease diagnosis.

Disease Prediction Sentence

UniTRec: A Unified Text-to-Text Transformer and Joint Contrastive Learning Framework for Text-based Recommendation

1 code implementation25 May 2023 Zhiming Mao, Huimin Wang, Yiming Du, Kam-Fai Wong

Moreover, conditioned on user history encoded by Transformer encoders, our framework leverages Transformer decoders to estimate the language perplexity of candidate text items, which can serve as a straightforward yet significant contrastive signal for user-item text matching.

Contrastive Learning Text Matching

Integrating Pretrained Language Model for Dialogue Policy Learning

no code implementations2 Nov 2021 Hongru Wang, Huimin Wang, Zezhong Wang, Kam-Fai Wong

Reinforcement Learning (RL) has been witnessed its potential for training a dialogue policy agent towards maximizing the accumulated rewards given from users.

Language Modelling Reinforcement Learning (RL)

Learning Efficient Dialogue Policy from Demonstrations through Shaping

no code implementations ACL 2020 Huimin Wang, Baolin Peng, Kam-Fai Wong

Training a task-oriented dialogue agent with reinforcement learning is prohibitively expensive since it requires a large volume of interactions with users.

Domain Adaptation

Cannot find the paper you are looking for? You can Submit a new open access paper.