no code implementations • 12 Mar 2024 • Motoki Omura, Takayuki Osa, Yusuke Mukuta, Tatsuya Harada
In deep reinforcement learning, estimating the value function to evaluate the quality of states and actions is essential.
Continuous Control Q-Learning +1