SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining

Yue Li

Qi Ma

Runyi Yang

Huapeng Li

Mengjiao Ma

Bin Ren

Nikola Popovic

Nicu Sebe

Ender Konukoglu

Luc Van Gool

Martin R. Oswald

Recognizing arbitrary or previously unseen categories is essential for comprehensive real-world 3D scene understanding. Currently, all existing methods rely on 2D or textual modalities during training, or together at inference. This highlights a clear absence of a model capable of processing 3D data alone for learning semantics end-to-end, along with the necessary data to train such a model. Meanwhile, 3D Gaussian Splatting (3DGS) has emerged as the de facto standard for 3D scene representation across various vision tasks. However, effectively integrating semantic reasoning into 3DGS in a generalizable fashion remains an open challenge. To address these limitations we introduce SceneSplat, to our knowledge the first large-scale 3D indoor scene understanding approach that operates natively on 3DGS. Furthermore, we propose a self-supervised learning scheme that unlocks rich 3D feature learning from unlabeled scenes. In order to power the proposed methods, we introduce SceneSplat-7K, the first large-scale 3DGS dataset for indoor scenes, comprising of 6868 scenes derived from 7 established datasets like ScanNet, Matterport3D, etc. Generating SceneSplat-7K required computational resources equivalent to 119 GPU-days on an L4 GPU, enabling standardized benchmarking for 3DGS-based reasoning for indoor scenes. Our exhaustive experiments on SceneSplat-7K demonstrate the significant benefit of the proposed methods over the established baselines.

PDF URL

Featured

Platforms

SplatCapture 3.0.0 Adds Spline Capture

SplatCapture 3.0.0 adds a spline capture mode, Unreal Engine 5.8 support, and square-aspect camera rigs that give Gaussian splatting trainers clean intrinsics.

Michael Rubloff

Jul 17, 2026

Platforms

Marble x Nuke Loads World Labs Marble Splats Straight Into Nuke's 3D Viewport

Marble x Nuke is a free Nuke 17 toolset that turns World Labs Marble text or image prompts into Gaussian splats loaded straight into Nuke's native 3D viewport.

Michael Rubloff

Jul 15, 2026

Platforms

Houdini 22 Ships, Making Native Gaussian Splats Generally Available

Houdini 22 is now generally available, making SideFX's native Gaussian splatting pipeline, PDG training and Copernicus compositing, production ready today.

Michael Rubloff

Jul 15, 2026

360 Gaussian v1.4.5 Adds Per-Clip Extraction Settings

360 Gaussian v1.4.5 updates the 360 video to 3DGS tool with per-clip extraction settings and fixes swapped GPS coordinates for accurate scale.

Michael Rubloff

Jul 15, 2026