Search Results for author: Zibin Dong

Found 3 papers, 1 papers with code

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback

1 code implementation4 Feb 2024 Yifu Yuan, Jianye Hao, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, Yan Zheng

It is crucial to consider diverse human feedback types and various learning methods in different environments.

DiffuserLite: Towards Real-time Diffusion Planning

no code implementations27 Jan 2024 Zibin Dong, Jianye Hao, Yifu Yuan, Fei Ni, Yitian Wang, Pengyi Li, Yan Zheng

Diffusion planning has been recognized as an effective decision-making paradigm in various domains.

D4RL Decision Making

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model

no code implementations3 Oct 2023 Zibin Dong, Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Tangjie Lv, Changjie Fan, Zhipeng Hu

Aligning agent behaviors with diverse human preferences remains a challenging problem in reinforcement learning (RL), owing to the inherent abstractness and mutability of human preferences.

Attribute Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.