no code implementations • 7 Feb 2024 • Carlo Alfano, Sebastian Towers, Silvia Sapora, Chris Lu, Patrick Rebeschini
Policy Mirror Descent (PMD) is a popular framework in reinforcement learning, serving as a unifying perspective that encompasses numerous algorithms.
no code implementations • 16 Mar 2023 • Chris Lu, Sebastian Towers, Jakob Foerster
Meta-learning, the notion of learning to learn, enables learning systems to quickly and flexibly solve new tasks.