no code implementations • 8 Jun 2023 • Shmuel Bar-David, Itamar Zimerman, Eliya Nachmani, Lior Wolf
Recently, sequence learning methods have been applied to the problem of off-policy Reinforcement Learning, including the seminal work on Decision Transformers, which employs transformers for this task.