no code implementations • 16 Jun 2023 • Guangyu Wang, Guoxing Yang, Zongxin Du, Longjun Fan, Xiaohu Li
Large language models have exhibited exceptional performance on various Natural Language Processing (NLP) tasks, leveraging techniques such as the pre-training, and instruction fine-tuning.
no code implementations • ICML 2020 • John D. Martin, Michal Lyskawinski, Xiaohu Li, Brendan Englot
We describe a new approach for managing aleatoric uncertainty in the Reinforcement Learning (RL) paradigm.
Distributional Reinforcement Learning reinforcement-learning +1