no code implementations • 18 Aug 2023 • Eric Hsiung, Joydeep Biswas, Swarat Chaudhuri
Reward machines have shown great promise at capturing non-Markovian reward functions for learning tasks that involve complex action sequencing.
no code implementations • 11 Oct 2021 • Eric Hsiung, Hiloni Mehta, Junchi Chu, Xinyu Liu, Roma Patel, Stefanie Tellex, George Konidaris
We compare our method of mapping natural language task specifications to intermediate contextual queries against state-of-the-art CopyNet models capable of translating natural language to LTL, by evaluating whether correct LTL for manipulation and navigation task specifications can be output, and show that our method outperforms the CopyNet model on unseen object references.
no code implementations • 28 Jul 2021 • Sreehari Rammohan, Shangqun Yu, Bowen He, Eric Hsiung, Eric Rosen, Stefanie Tellex, George Konidaris
Learning continuous control in high-dimensional sparse reward settings, such as robotic manipulation, is a challenging problem due to the number of samples often required to obtain accurate optimal value and policy estimates.