1 code implementation • 4 Jan 2024 • Qian Lin, Chao Yu, Zongkai Liu, Zifan Wu
In this paper, we aim to utilize only offline trajectory data to train a policy for multi-objective RL.
Multi-Objective Reinforcement Learning Offline RL +1