no code implementations • 7 Apr 2024 • Zetong Xuan, Alper Kamil Bozkurt, Miroslav Pajic, Yu Wang
In a widely-adopted surrogate reward approach, two discount factors are used to ensure that the expected return approximates the satisfaction probability of the LTL objective.