Search Results for author: Diwei Huang

Found 1 papers, 1 papers with code

Learning Vision-and-Language Navigation from YouTube Videos

1 code implementation • ICCV 2023 • Kunyang Lin, Peihao Chen, Diwei Huang, Thomas H. Li, Mingkui Tan, Chuang Gan

In this paper, we propose to learn an agent from these videos by creating a large-scale dataset which comprises reasonable path-instruction pairs from house tour videos and pre-training the agent on it.

Navigate Vision and Language Navigation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.