Search Results for author: Peixin Cao

Adversarial Preference Optimization

Human preference alignment is essential to improve the interaction quality of large language models (LLMs).

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.