no code implementations • 5 Feb 2024 • Bahareh Tasdighi, Nicklas Werge, Yi-Shan Wu, Melih Kandemir
We introduce Probabilistic Actor-Critic (PAC), a novel reinforcement learning algorithm with improved continuous control performance thanks to its ability to mitigate the exploration-exploitation trade-off.
no code implementations • 30 Jan 2023 • Bahareh Tasdighi, Abdullah Akgül, Kenny Kazimirzak Brink, Melih Kandemir
Actor-critic algorithms address the dual goals of reinforcement learning (RL), policy evaluation and improvement, via two separate function approximators.