Search Results for author: Shichao Sun

Found 7 papers, 6 papers with code

Dissecting Human and LLM Preferences

1 code implementation • 17 Feb 2024 • Junlong Li, Fan Zhou, Shichao Sun, Yikai Zhang, Hai Zhao, PengFei Liu

As a relative quality comparison of model responses, human and Large Language Model (LLM) preferences serve as common alignment goals in model fine-tuning and criteria in evaluation.

Language Modelling Large Language Model

Paper
Code

The Critique of Critique

1 code implementation • 9 Jan 2024 • Shichao Sun, Junlong Li, Weizhe Yuan, Ruifeng Yuan, Wenjie Li, PengFei Liu

In this paper, we pioneer the critique of critique, termed MetaCritique, which is a framework to evaluate the critique from two aspects, i. e., factuality as precision score and comprehensiveness as recall score.

Question Answering

Paper
Code

Evolving Large Language Model Assistant with Long-Term Conditional Memory

no code implementations • 22 Dec 2023 • Ruifeng Yuan, Shichao Sun, Zili Wang, Ziqiang Cao, Wenjie Li

It focuses on preserving the knowledge and experience from the history dialogue between the user and AI assistant, which can be applied to future dialogue for generating a better response.

Language Modelling Large Language Model +1

Paper
Add Code

Generative Judge for Evaluating Alignment

1 code implementation • 9 Oct 2023 • Junlong Li, Shichao Sun, Weizhe Yuan, Run-Ze Fan, Hai Zhao, PengFei Liu

The rapid development of Large Language Models (LLMs) has substantially expanded the range of tasks they can address.

178

Paper
Code

Aligning Language Models with Human Preferences via a Bayesian Approach

1 code implementation • NeurIPS 2023 • Jiashuo Wang, Haozhao Wang, Shichao Sun, Wenjie Li

For this alignment, current popular methods leverage a reinforcement learning (RL) approach with a reward model trained on feedback from humans.

Contrastive Learning Reinforcement Learning (RL) +1

Paper
Code

Improving Sentence Similarity Estimation for Unsupervised Extractive Summarization

1 code implementation • 24 Feb 2023 • Shichao Sun, Ruifeng Yuan, Wenjie Li, Sujian Li

Unsupervised extractive summarization aims to extract salient sentences from a document as the summary without labeled data.

Contrastive Learning Extractive Summarization +3

Paper
Code

Alleviating Exposure Bias via Contrastive Learning for Abstractive Text Summarization

1 code implementation • 26 Aug 2021 • Shichao Sun, Wenjie Li

During the training stage, with teacher forcing these models are optimized to maximize the likelihood of the gold summary given the gold summary tokens as input to the decoder, while at inference the given tokens are replaced by the generated tokens.

Abstractive Text Summarization Contrastive Learning +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.