no code implementations • 26 Feb 2023 • Hsiang-Chun Wang, Shang-Fu Chen, Ming-Hao Hsu, Chun-Mao Lai, Shao-Hua Sun
Most existing imitation learning methods that do not require interacting with environments either model the expert distribution as the conditional probability p(a|s) (e. g., behavioral cloning, BC) or the joint probability p(s, a).