Entropy regularization is used to get improved optimization performance in reinforcement learning tasks. A common form of regularization is to maximize policy entropy to avoid premature convergence and lead to more stochastic policies for exploration through action space... (read more)
PDFMETHOD | TYPE | |
---|---|---|
![]() |
Regularization |