Efficient Camera Pose Augmentation for View Generalization in Robotic Policy Learning

Sen Wang

Huaiyi Dong

Jingyi Tian

Jiayi Li

Zhuo Yang

Tongtong Cao

Anlin Chen

Shuang Wu

Le Wang

Prevailing 2D-centric visuomotor policies exhibit a pronounced deficiency in novel view generalization, as their reliance on static observations hinders consistent action mapping across unseen views. In response, we introduce GenSplat, a feed-forward 3D Gaussian Splatting framework that facilitates view-generalized policy learning through novel view rendering. GenSplat employs a permutation-equivariant architecture to reconstruct high-fidelity 3D scenes from sparse, uncalibrated inputs in a single forward pass. To ensure structural integrity, we design a 3D-prior distillation strategy that regularizes the 3DGS optimization, preventing the geometric collapse typical of purely photometric supervision. By rendering diverse synthetic views from these stable 3D representations, we systematically augment the observational manifold during training. This augmentation forces the policy to ground its decisions in underlying 3D structures, thereby ensuring robust execution under severe spatial perturbations where baselines severely degrade.

PDF URL

Featured

Platforms

World Labs Releases Marble 1.1 and Marble 1.1 Plus

Two nice new updates are here from World Labs.

Michael Rubloff

Apr 2, 2026

Platforms

Babylon.js Releases V9.1

Some nice updates from the release last week have been merged.

Michael Rubloff

Apr 2, 2026

Platforms

SplatRenderer v1.1.0 Adds Level Sequencer for 4DGS in Unreal Engine 5

A quick new update for SplatRenderer just dropped.

Michael Rubloff

Apr 2, 2026

Artwork

The Olympic Exhibition That Almost Disappeared

How artist Josette Seitz used a Gaussian Splat to preserve an Olympic exhibition that was only accessible for two weeks — and is now explorable by anyone.

Michael Rubloff

Apr 1, 2026