Visual Navigation

105 papers with code • 6 benchmarks • 16 datasets

Visual Navigation is the problem of navigating an agent, e.g. a mobile robot, in an environment using camera input only. The agent is given a target image (an image it will see from the target position), and its goal is to move from its current position to the target by applying a sequence of actions, based on the camera observations only.

Source: Vision-based Navigation Using Deep Reinforcement Learning

Benchmarks

Add a Result

These leaderboards are used to track progress in Visual Navigation

Dataset	Best Model	Compare
R2R	Meta-Explore	See all
Cooperative Vision-and-Dialogue Navigation	NaviLLM	See all
SOON Test	AutoVLN	See all
AI2-THOR	MVV-IN	See all
Dmlab-30	PopArt-IMPALA	See all
Help, Anna! (HANNA)	Prevalent	See all

Libraries

Use these libraries to find Visual Navigation models and implementations

mchancan/citylearn

2 papers

Datasets

Latest papers

Most implemented Social Latest No code

Offline Reinforcement Learning for Visual Navigation

arjunbhorkar/ReViND • • 16 Dec 2022

Reinforcement learning can enable robots to navigate to distant goals while optimizing user-specified reward functions, including preferences for following lanes, staying on paved paths, or avoiding freshly mowed grass.

16 Dec 2022

Paper
Code

BEVBert: Multimodal Map Pre-training for Language-guided Navigation

marsaki/vln-bevbert • • ICCV 2023

Concretely, we build a local metric map to explicitly aggregate incomplete observations and remove duplicates, while modeling navigation dependency in a global topological map.

162

08 Dec 2022

Paper
Code

Last-Mile Embodied Visual Navigation

jbwasse2/sling • • 21 Nov 2022

Realistic long-horizon tasks like image-goal navigation involve exploratory and exploitative phases.

21 Nov 2022

Paper
Code

Towards Versatile Embodied Navigation

hanqingwangai/vxn • • 30 Oct 2022

With the emergence of varied visual navigation tasks (e. g, image-/object-/audio-goal and vision-language navigation) that specify the target in different ways, the community has made appealing advances in training specialized agents capable of handling individual navigation tasks well.

30 Oct 2022

Paper
Code

ViNL: Visual Navigation and Locomotion Over Obstacles

SimarKareer/legged_gym • • 26 Oct 2022

ViNL consists of: (1) a visual navigation policy that outputs linear and angular velocity commands that guides the robot to a goal coordinate in unfamiliar indoor environments; and (2) a visual locomotion policy that controls the robot's joints to avoid stepping on obstacles while following provided velocity commands.

26 Oct 2022

Paper
Code