no code implementations • 11 Mar 2024 • Narim Jeong, Donghwan Lee
We hope that our analysis will deepen the current understanding of soft Q-learning by establishing connections with switching system models and may even pave the way for new frameworks in the finite-time analysis of other reinforcement learning algorithms.