Search Results for author: Bohao Qu

Found 2 papers, 0 papers with code

Transductive Reward Inference on Graph

no code implementations • 6 Feb 2024 • Bohao Qu, Xiaofeng Cao, Qing Guo, Yi Chang, Ivor W. Tsang, Chengqi Zhang

In this study, we present a transductive inference approach on that reward information propagation graph, which enables the effective estimation of rewards for unlabelled data in offline reinforcement learning.

reinforcement-learning

Paper
Add Code

Policy Dispersion in Non-Markovian Environment

no code implementations • 28 Feb 2023 • Bohao Qu, Xiaofeng Cao, Jielong Yang, Hechang Chen, Chang Yi, Ivor W. Tsang, Yew-Soon Ong

To resolve this problem, this paper tries to learn the diverse policies from the history of state-action pairs under a non-Markovian environment, in which a policy dispersion scheme is designed for seeking diverse policy representation.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.