no code implementations • 10 Jul 2023 • Vincent Liu, Han Wang, Ruo Yu Tao, Khurram Javed, Adam White, Martha White
Lastly, we outline a class of algorithms which we call online-aware that are designed to mitigate interference, and show they do reduce interference according to our measure and that they improve stability and performance in several classic control environments.
1 code implementation • 15 Nov 2022 • Ruo Yu Tao, Adam White, Marlos C. Machado
Finally, we show that this approach is complementary to state-of-the-art methods such as recurrent neural networks and truncated back-propagation through time, and acts as a heuristic that facilitates longer temporal credit assignment, leading to better performance.
1 code implementation • NeurIPS 2020 • Ruo Yu Tao, Vincent François-Lavet, Joelle Pineau
We then leverage these intrinsic rewards for sample-efficient exploration with planning routines in representational space for hard exploration tasks with sparse rewards.
2 code implementations • 3 Dec 2018 • Ruo Yu Tao, Marc-Alexandre Côté, Xingdi Yuan, Layla El Asri
To solve a text-based game, an agent needs to formulate valid text commands for a given context and find the ones that lead to success.
1 code implementation • 29 Jun 2018 • Marc-Alexandre Côté, Ákos Kádár, Xingdi Yuan, Ben Kybartas, Tavian Barnes, Emery Fine, James Moore, Ruo Yu Tao, Matthew Hausknecht, Layla El Asri, Mahmoud Adada, Wendy Tay, Adam Trischler
We introduce TextWorld, a sandbox learning environment for the training and evaluation of RL agents on text-based games.