Search Results for author: Wenjie Qiu

Found 2 papers, 0 papers with code

Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics

no code implementations17 Feb 2024 Xinyu Zhang, Wenjie Qiu, Yi-Chen Li, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu

DORA incorporates an information bottleneck principle that maximizes mutual information between the dynamics encoding and the environmental data, while minimizing mutual information between the dynamics encoding and the actions of the behavior policy.

Representation Learning

Programmatic Reinforcement Learning without Oracles

no code implementations ICLR 2022 Wenjie Qiu, He Zhu

Our first contribution is a programmatically interpretable RL framework that conducts program architecture search on top of a continuous relaxation of the architecture space defined by programming language grammar rules.

Bilevel Optimization Policy Gradient Methods +2

Cannot find the paper you are looking for? You can Submit a new open access paper.