Search Results for author: Vihang P. Patil

Found 1 papers, 1 papers with code

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution

1 code implementation • 29 Sep 2020 • Vihang P. Patil, Markus Hofmarcher, Marius-Constantin Dinu, Matthias Dorfer, Patrick M. Blies, Johannes Brandstetter, Jose A. Arjona-Medina, Sepp Hochreiter

For such complex tasks, the recently proposed RUDDER uses reward redistribution to leverage steps in the Q-function that are associated with accomplishing sub-tasks.

General Reinforcement Learning Multiple Sequence Alignment +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.