Search Results for author: Shichao Sun

Found 7 papers, 6 papers with code

Dissecting Human and LLM Preferences

1 code implementation17 Feb 2024 Junlong Li, Fan Zhou, Shichao Sun, Yikai Zhang, Hai Zhao, PengFei Liu

As a relative quality comparison of model responses, human and Large Language Model (LLM) preferences serve as common alignment goals in model fine-tuning and criteria in evaluation.

Language Modelling Large Language Model

The Critique of Critique

1 code implementation9 Jan 2024 Shichao Sun, Junlong Li, Weizhe Yuan, Ruifeng Yuan, Wenjie Li, PengFei Liu

In this paper, we pioneer the critique of critique, termed MetaCritique, which is a framework to evaluate the critique from two aspects, i. e., factuality as precision score and comprehensiveness as recall score.

Question Answering

Evolving Large Language Model Assistant with Long-Term Conditional Memory

no code implementations22 Dec 2023 Ruifeng Yuan, Shichao Sun, Zili Wang, Ziqiang Cao, Wenjie Li

It focuses on preserving the knowledge and experience from the history dialogue between the user and AI assistant, which can be applied to future dialogue for generating a better response.

Language Modelling Large Language Model +1

Generative Judge for Evaluating Alignment

1 code implementation9 Oct 2023 Junlong Li, Shichao Sun, Weizhe Yuan, Run-Ze Fan, Hai Zhao, PengFei Liu

The rapid development of Large Language Models (LLMs) has substantially expanded the range of tasks they can address.

Aligning Language Models with Human Preferences via a Bayesian Approach

1 code implementation NeurIPS 2023 Jiashuo Wang, Haozhao Wang, Shichao Sun, Wenjie Li

For this alignment, current popular methods leverage a reinforcement learning (RL) approach with a reward model trained on feedback from humans.

Contrastive Learning Reinforcement Learning (RL) +1

Improving Sentence Similarity Estimation for Unsupervised Extractive Summarization

1 code implementation24 Feb 2023 Shichao Sun, Ruifeng Yuan, Wenjie Li, Sujian Li

Unsupervised extractive summarization aims to extract salient sentences from a document as the summary without labeled data.

Contrastive Learning Extractive Summarization +3

Alleviating Exposure Bias via Contrastive Learning for Abstractive Text Summarization

1 code implementation26 Aug 2021 Shichao Sun, Wenjie Li

During the training stage, with teacher forcing these models are optimized to maximize the likelihood of the gold summary given the gold summary tokens as input to the decoder, while at inference the given tokens are replaced by the generated tokens.

Abstractive Text Summarization Contrastive Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.