Search Results for author: Zac Wellmer

Found 3 papers, 1 papers with code

Dropout's Dream Land: Generalization from Learned Simulators to Reality

1 code implementation17 Sep 2021 Zac Wellmer, James T. Kwok

By training the World Model using dropout, the dream environment is capable of creating a nearly infinite number of different dream environments.

Policy Tree Network

no code implementations25 Sep 2019 Zac Wellmer, Sepanta Zeighami, James Kwok

However, decision-time planning with implicit dynamics models in continuous action space has proven to be a difficult problem.

Model-based Reinforcement Learning Policy Gradient Methods +3

Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space

no code implementations15 Sep 2019 Zac Wellmer, James Kwok

This paper proposes a novel deep reinforcement learning architecture that was inspired by previous tree structured architectures which were only useable in discrete action spaces.

Continuous Control Model-based Reinforcement Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.