no code implementations • 21 Jul 2022 • Adam Villaflor, Zhe Huang, Swapnil Pande, John Dolan, Jeff Schneider
Impressive results in natural language processing (NLP) based on the Transformer neural network architecture have inspired researchers to explore viewing offline reinforcement learning (RL) as a generic sequence modeling problem.