Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments

CVPR 2018 Peter AndersonQi WuDamien TeneyJake BruceMark JohnsonNiko SünderhaufIan ReidStephen GouldAnton van den Hengel

A robot that can carry out a natural-language instruction has been a dream since before the Jetsons cartoon series imagined a life of leisure mediated by a fleet of attentive robot helpers. It is a dream that remains stubbornly distant... (read more)

PDF Abstract

Results from the Paper


TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Visual Navigation Room-to-Room Seq2Seq baseline spl 0.18 # 3

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet