1 code implementation • 27 Apr 2020 • Jerrod Parker, Jerry Zikun Chen
Recent algorithms in machine translation have included a value network to assist the policy network when deciding which word to output at each step of the translation.
1 code implementation • 18 Apr 2020 • Jerrod Parker, Shakti Kumar, Joe Roussy
Recent developments in Transformers for language modeling have opened new areas of research in computer vision.
1 code implementation • 8 Apr 2020 • Shakti Kumar, Jerrod Parker, Panteha Naderian
In this work we first partially replicate the results shown in Stabilizing Transformers in RL on both reactive and memory based environments.
Partially Observable Reinforcement Learning reinforcement-learning +1