1 code implementation • 2 Jun 2022 • Kevin Esslinger, Robert Platt, Christopher Amato
Such tasks typically require some form of memory, where the agent has access to multiple past observations, in order to perform well.
Partially Observable Reinforcement Learning reinforcement-learning +1