no code implementations • NeurIPS 2020 • Severin Berger, Christian K. Machens
More specifically, we focus on MDPs whose state is based on action-observation histories, and we show how to compress the state space such that unnecessary redundancy is eliminated, while task-relevant information is preserved.