Search Results for author: Fengshuo Bai

Found 3 papers, 0 papers with code

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects

no code implementations20 Feb 2024 Zhaowei Zhang, Fengshuo Bai, Mingzhi Wang, Haoyang Ye, Chengdong Ma, Yaodong Yang

The burgeoning integration of artificial intelligence (AI) into human society brings forth significant implications for societal governance and safety.

Measuring Value Understanding in Language Models through Discriminator-Critique Gap

no code implementations30 Sep 2023 Zhaowei Zhang, Fengshuo Bai, Jun Gao, Yaodong Yang

We argue that truly understanding values in LLMs requires considering both "know what" and "know why".

Zero-shot Preference Learning for Offline RL via Optimal Transport

no code implementations6 Jun 2023 Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li

In this paper, we propose a novel zero-shot preference-based RL algorithm that leverages labeled preference data from source tasks to infer labels for target tasks, eliminating the requirement for human queries.

Offline RL

Cannot find the paper you are looking for? You can Submit a new open access paper.