no code implementations • 19 Jan 2024 • Dayang Liang, Yaru Zhang, Yunlong Liu
As a result, our method is able to simultaneously achieve the full utilization of retrieval information and the better evaluation of state values by a Temporal Difference (TD) loss.
1 code implementation • 22 Sep 2023 • Dayang Liang, Qihang Chen, Yunlong Liu
Specifically, we propose a Sequential Action--induced invariant Representation (SAR) method, in which the encoder is optimized by an auxiliary learner to only preserve the components that follow the control signals of sequential actions, so the agent can be induced to learn the robust representation against distractions.