1 code implementation • 25 Jun 2022 • Igor Kuznetsov
We address these challenges by proposing a novel guided exploration method that uses a differential directional controller to incorporate scalable exploratory action correction.
1 code implementation • 16 Jun 2021 • Igor Kuznetsov, Andrey Filchenkov
The application of episodic memory for continuous control with a large action space is not trivial.
no code implementations • 13 Jun 2019 • Arip Asadulaev, Igor Kuznetsov, Andrey Filchenkov
It is important to develop mathematically tractable models than can interpret knowledge extracted from the data and provide reasonable predictions.
no code implementations • 13 Jun 2019 • Arip Asadulaev, Igor Kuznetsov, Gideon Stein, Andrey Filchenkov
In this paper, we try to answer the following question: Can information about policy conditioning help to shape a more stable and general policy of reinforcement learning agents?