Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting

Keyi Liu

Weidong Yang

Ben Fei

Ying He

Self-supervised learning (SSL) for point cloud pre-training has become a cornerstone for many 3D vision tasks, enabling effective learning from large-scale unannotated data. At the scene level, existing SSL methods often incorporate volume rendering into the pre-training framework, using RGB-D images as reconstruction signals to facilitate cross-modal learning. This strategy promotes alignment between 2D and 3D modalities and enables the model to benefit from rich visual cues in the RGB-D inputs. However, these approaches are limited by their reliance on implicit scene representations and high memory demands. Furthermore, since their reconstruction objectives are applied only in 2D space, they often fail to capture underlying 3D geometric structures. To address these challenges, we propose Gaussian2Scene, a novel scene-level SSL framework that leverages the efficiency and explicit nature of 3D Gaussian Splatting (3DGS) for pre-training. The use of 3DGS not only alleviates the computational burden associated with volume rendering but also supports direct 3D scene reconstruction, thereby enhancing the geometric understanding of the backbone network. Our approach follows a progressive two-stage training strategy. In the first stage, a dual-branch masked autoencoder learns both 2D and 3D scene representations. In the second stage, we initialize training with reconstructed point clouds and further supervise learning using the geometric locations of Gaussian primitives and rendered RGB images. This process reinforces both geometric and cross-modal learning. We demonstrate the effectiveness of Gaussian2Scene across several downstream 3D object detection tasks, showing consistent improvements over existing pre-training methods.

PDF URL

Featured

Platforms

COLMAP 4.0 Introduces Major Performance and Infrastructure Updates

The indispensable library just released a massive update.

Michael Rubloff

Mar 15, 2026

Platforms

StorySplat 2.2 Adds Image to Splat, Animation, and Measurement Tools

StorySplat just shipped V2.2 with a bunch of updates.

Michael Rubloff

Mar 13, 2026

Platforms

Postshot Adds Photometric Compensation and More in Latest Update

Several nice updates to Postshot just released.

Michael Rubloff

Mar 13, 2026

Platforms

Radiancefields.com Announces Gaussian SplatKing for Mobile Capture

I've been working to bring a pipeline agnostic capture tool for gaussian splatting.

Michael Rubloff

Mar 13, 2026