no code implementations • 25 Apr 2024 • Bram De Cooman, Johan Suykens
In this work we try to unify these existing techniques and bridge the gap with classical optimization and control theory, using a generic primal-dual framework for value-based and actor-critic reinforcement learning methods.