VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction

Ziyue Zhu

Shenlong Wang

Jin Xie

Jiang-jiang Liu

Jingdong Wang

Jian Yang

Recent advancements in camera-based occupancy prediction have focused on the simultaneous prediction of 3D semantics and scene flow, a task that presents significant challenges due to specific difficulties, e.g., occlusions and unbalanced dynamic environments. In this paper, we analyze these challenges and their underlying causes. To address them, we propose a novel regularization framework called VoxelSplat. This framework leverages recent developments in 3D Gaussian Splatting to enhance model performance in two key ways: (i) Enhanced Semantics Supervision through 2D Projection: During training, our method decodes sparse semantic 3D Gaussians from 3D representations and projects them onto the 2D camera view. This provides additional supervision signals in the camera-visible space, allowing 2D labels to improve the learning of 3D semantics. (ii) Scene Flow Learning: Our framework uses the predicted scene flow to model the motion of Gaussians, and is thus able to learn the scene flow of moving objects in a self-supervised manner using the labels of adjacent frames. Our method can be seamlessly integrated into various existing occupancy models, enhancing performance without increasing inference time. Extensive experiments on benchmark datasets demonstrate the effectiveness of VoxelSplat in improving the accuracy of both semantic occupancy and scene flow estimation. The project page and codes are available at https://zzy816.github.io/VoxelSplat-Demo/.

PDF URL

Featured

Platforms

SplatCapture 3.0.0 Adds Spline Capture

SplatCapture 3.0.0 adds a spline capture mode, Unreal Engine 5.8 support, and square-aspect camera rigs that give Gaussian splatting trainers clean intrinsics.

Michael Rubloff

Jul 17, 2026

Platforms

Marble x Nuke Loads World Labs Marble Splats Straight Into Nuke's 3D Viewport

Marble x Nuke is a free Nuke 17 toolset that turns World Labs Marble text or image prompts into Gaussian splats loaded straight into Nuke's native 3D viewport.

Michael Rubloff

Jul 15, 2026

Platforms

Houdini 22 Ships, Making Native Gaussian Splats Generally Available

Houdini 22 is now generally available, making SideFX's native Gaussian splatting pipeline, PDG training and Copernicus compositing, production ready today.

Michael Rubloff

Jul 15, 2026

360 Gaussian v1.4.5 Adds Per-Clip Extraction Settings

360 Gaussian v1.4.5 updates the 360 video to 3DGS tool with per-clip extraction settings and fixes swapped GPS coordinates for accurate scale.

Michael Rubloff

Jul 15, 2026