1 code implementation • 31 Aug 2015 • Hao Yi Ong
The problem is to decompose a matrix that encodes the true value function into low-rank and sparse components, and we achieve this using Robust Principal Component Analysis (PCA).
no code implementations • 18 Aug 2015 • Hao Yi Ong, Kevin Chavez, Augustus Hong
We propose a distributed deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning.