no code implementations • 19 Apr 2024 • Jianliang He, Han Zhong, Zhuoran Yang
Moreover, for AMDPs, we propose a novel complexity measure -- average-reward generalized eluder coefficient (AGEC) -- which captures the challenge of exploration in AMDPs with general function approximation.