1 code implementation • 12 Mar 2024 • Chengxing Jia, Fuxiang Zhang, Yi-Chen Li, Chen-Xiao Gao, Xu-Hui Liu, Lei Yuan, Zongzhang Zhang, Yang Yu
Specifically, the objective of adversarial data augmentation is not merely to generate data analogous to offline data distribution; instead, it aims to create adversarial examples designed to confound learned task representations and lead to incorrect task identification.
1 code implementation • 26 Dec 2023 • Renzhe Zhou, Chen-Xiao Gao, Zongzhang Zhang, Yang Yu
GENTLE employs Task Auto-Encoder~(TAE), which is an encoder-decoder architecture to extract the characteristics of the tasks.
1 code implementation • 12 Sep 2023 • Chen-Xiao Gao, Chenyang Wu, Mingjun Cao, Rui Kong, Zongzhang Zhang, Yang Yu
Third, we train an Advantage-Conditioned Transformer (ACT) to generate actions conditioned on the estimated advantages.