no code implementations • 1 Dec 2023 • Rohan Choudhury, Koichiro Niinuma, Kris M. Kitani, László A. Jeni
We propose to answer zero-shot questions about videos by generating short procedural programs that derive a final answer from solving a sequence of visual subtasks.
Ranked #6 on Zero-Shot Video Question Answer on NExT-QA
no code implementations • ICCV 2023 • Rohan Choudhury, Kris Kitani, Laszlo A. Jeni
In doing so, our model is able to use spatiotemporal context to predict more accurate human poses without sacrificing efficiency.
no code implementations • 4 Jan 2019 • Gokul Swamy, Jens Schulz, Rohan Choudhury, Dylan Hadfield-Menell, Anca Dragan
Fundamental to robotics is the debate between model-based and model-free learning: should the robot build an explicit model of the world, or learn a policy directly?