GaussNav: Gaussian Splatting for Visual Navigation

Xiaohan Lei

Min Wang

Wengang Zhou

Houqiang Li

In embodied vision, Instance ImageGoal Navigation (IIN) requires an agent to locate a specific object depicted in a goal image within an unexplored environment. The primary challenge of IIN arises from the need to recognize the target object across varying viewpoints while ignoring potential distractors. Existing map-based navigation methods typically use Bird's Eye View (BEV) maps, which lack detailed texture representation of a scene. Consequently, while BEV maps are effective for semantic-level visual navigation, they are struggling for instance-level tasks. To this end, we propose a new framework for IIN, Gaussian Splatting for Visual Navigation (GaussNav), which constructs a novel map representation based on 3D Gaussian Splatting (3DGS). The GaussNav framework enables the agent to memorize both the geometry and semantic information of the scene, as well as retain the textural features of objects. By matching renderings of similar objects with the target, the agent can accurately identify, ground, and navigate to the specified object. Our GaussNav framework demonstrates a significant performance improvement, with Success weighted by Path Length (SPL) increasing from 0.347 to 0.578 on the challenging Habitat-Matterport 3D (HM3D) dataset. The source code is publicly available at the link: https://github.com/XiaohanLei/GaussNav.

PDF URL

Featured

Platforms

NVIDIA Announces NuRec Gaussian Splatting Libraries at SIGGRAPH

Michael Rubloff

Aug 11, 2025

Platforms

NVIDIA Updates vkSplatting Playground

3DGRT support is here.

Michael Rubloff

Aug 11, 2025

Platforms

CG Nomads Release GSOPs 2.7 for Houdini

GSOPs 2.7 is here!

Michael Rubloff

Aug 10, 2025

Platforms

NVIDIA Release Kaolin V0.18.0

Bringing Simplicits Physics, Collisions, and More.

Michael Rubloff

Aug 10, 2025