Search Results for author: Ruochen Zhao

Found 11 papers, 5 papers with code

Lifelong Event Detection with Embedding Space Separation and Compaction

no code implementations • 3 Apr 2024 • Chengwei Qin, Ruirui Chen, Ruochen Zhao, Wenhan Xia, Shafiq Joty

However, the simple combination of memory data and new-task samples can still result in substantial forgetting of previously acquired knowledge, which may occur due to the potential overlap between the feature distribution of new data and the previously learned embedding space.

Event Detection Transfer Learning

Paper
Add Code

How Much are LLMs Contaminated? A Comprehensive Survey and the LLMSanitize Library

1 code implementation • 31 Mar 2024 • Mathieu Ravaut, Bosheng Ding, Fangkai Jiao, Hailin Chen, Xingxuan Li, Ruochen Zhao, Chengwei Qin, Caiming Xiong, Shafiq Joty

With the rise of Large Language Models (LLMs) in recent years, new opportunities are emerging, but also new challenges, and contamination is quickly becoming critical.

Question Answering

Paper
Code

Data Augmentation using LLMs: Data Perspectives, Learning Paradigms and Challenges

no code implementations • 5 Mar 2024 • Bosheng Ding, Chengwei Qin, Ruochen Zhao, Tianze Luo, Xinze Li, Guizhen Chen, Wenhan Xia, Junjie Hu, Anh Tuan Luu, Shafiq Joty

In the rapidly evolving field of machine learning (ML), data augmentation (DA) has emerged as a pivotal technique for enhancing model performance by diversifying training examples without the need for additional data collection.

Data Augmentation

Paper
Add Code

ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?

1 code implementation • 28 Nov 2023 • Hailin Chen, Fangkai Jiao, Xingxuan Li, Chengwei Qin, Mathieu Ravaut, Ruochen Zhao, Caiming Xiong, Shafiq Joty

Upon its release in late 2022, ChatGPT has brought a seismic shift in the entire landscape of AI, both in research and commerce.

Language Modelling Large Language Model

Paper
Code

PromptSum: Parameter-Efficient Controllable Abstractive Summarization

no code implementations • 6 Aug 2023 • Mathieu Ravaut, Hailin Chen, Ruochen Zhao, Chengwei Qin, Shafiq Joty, Nancy Chen

Prompt tuning (PT), a parameter-efficient technique that only tunes the additional prompt embeddings while keeping the backbone pre-trained language model (PLM) frozen, has shown promising results in language understanding tasks, especially in low-resource scenarios.

Abstractive Text Summarization Language Modelling

Paper
Add Code

Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources

1 code implementation • 22 May 2023 • Xingxuan Li, Ruochen Zhao, Yew Ken Chia, Bosheng Ding, Shafiq Joty, Soujanya Poria, Lidong Bing

Specifically, CoK consists of three stages: reasoning preparation, dynamic knowledge adapting, and answer consolidation.

Hallucination Language Modelling +1

Paper
Code

Randomized Smoothing with Masked Inference for Adversarially Robust Text Classifications

1 code implementation • 11 May 2023 • Han Cheol Moon, Shafiq Joty, Ruochen Zhao, Megh Thakkar, Xu Chi

Large-scale pre-trained language models have shown outstanding performance in a variety of NLP tasks.

Adversarial Robustness

Paper
Code

Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework

1 code implementation • 5 May 2023 • Ruochen Zhao, Xingxuan Li, Shafiq Joty, Chengwei Qin, Lidong Bing

As large language models (LLMs) have become the norm in NLP, demonstrating good performance in generation and reasoning tasks, one of its most fatal disadvantages is the lack of factual correctness.

Open-Domain Question Answering

Paper
Code

Explaining Language Models' Predictions with High-Impact Concepts

no code implementations • 3 May 2023 • Ruochen Zhao, Shafiq Joty, Yongjie Wang, Tan Wang

The emergence of large-scale pretrained language models has posed unprecedented challenges in deriving explanations of why the model has made some predictions.

Fairness Vocal Bursts Intensity Prediction

Paper
Add Code

Retrieving Multimodal Information for Augmented Generation: A Survey

no code implementations • 20 Mar 2023 • Ruochen Zhao, Hailin Chen, Weishi Wang, Fangkai Jiao, Xuan Long Do, Chengwei Qin, Bosheng Ding, Xiaobao Guo, Minzhi Li, Xingxuan Li, Shafiq Joty

As Large Language Models (LLMs) become popular, there emerged an important trend of using multimodality to augment the LLMs' generation ability, which enables LLMs to better interact with the world.

Retrieval

Paper
Add Code

Learning to Initialize: Can Meta Learning Improve Cross-task Generalization in Prompt Tuning?

no code implementations • 16 Feb 2023 • Chengwei Qin, Qian Li, Ruochen Zhao, Shafiq Joty

Despite this, PT has been shown to rely heavily on good initialization of the prompt embeddings.

Few-Shot Learning Language Modelling +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.