no code implementations • 18 Dec 2020 • Sultan Javed Majeed, Marcus Hutter
In this work we show how action-binarization in the non-MDP case can significantly improve Extreme State Aggregation (ESA) bounds.
no code implementations • 9 Nov 2018 • Sultan Javed Majeed, Marcus Hutter
However, we show that near-optimal performance is sometimes guaranteed even if the homomorphism is non-Markovian.