GaussFly: Contrastive Reinforcement Learning for Visuomotor Policies in 3D Gaussian Fields

Yuhang Zhang

Mingsheng Li

Yujing Shang

Zhuoyuan Yu

Chao Yan

Jiaping Xiao

Mir Feroskhan

Learning visuomotor policies for Autonomous Aerial Vehicles (AAVs) relying solely on monocular vision is an attractive yet highly challenging paradigm. Existing end-to-end learning approaches directly map high-dimensional RGB observations to action commands, which frequently suffer from low sample efficiency and severe sim-to-real gaps due to the visual discrepancy between simulation and physical domains. To address these long-standing challenges, we propose GaussFly, a novel framework that explicitly decouples representation learning from policy optimization through a cohesive real-to-sim-to-real paradigm. First, to achieve a high-fidelity real-to-sim transition, we reconstruct training scenes using 3D Gaussian Splatting (3DGS) augmented with explicit geometric constraints. Second, to ensure robust sim-to-real transfer, we leverage these photorealistic simulated environments and employ contrastive representation learning to extract compact, noise-resilient latent features from the rendered RGB images. By utilizing this pre-trained encoder to provide low-dimensional feature inputs, the computational burden on the visuomotor policy is significantly reduced while its resistance against visual noise is inherently enhanced. Extensive experiments in simulated and real-world environments demonstrate that GaussFly achieves superior sample efficiency and asymptotic performance compared to baselines. Crucially, it enables robust and zero-shot policy transfer to unseen real-world environments with complex textures, effectively bridging the sim-to-real gap.

PDF URL

Featured

Platforms

How Gaussian Splats Helped Bring Madrid’s Calle Alcalá Onto an LED Volume for Netflix’s Berlin

Netflix’s Berlin and the Lady with an Ermine used Volinga’s Gaussian splatting workflow to bring Madrid’s Calle Alcalá into a cinematic LED volume virtual production scene.

Michael Rubloff

May 19, 2026

Platforms

Spatial Studio Adds AI Authoring Layer

Real Horizons has shipped AI Reframe virtual staging, Spatial Props object-level splat generation, Smart Hotspots, Auto Translate, PlayCanvas LOD streaming, and multi-splat tours to Spatial Studio.

Michael Rubloff

May 18, 2026

Platforms

A Major Championship, Reconstructed: Gaussian Splatting Goes to the PGA

The PGA showed both static and dynamic gaussian splatting last week during the

Michael Rubloff

May 18, 2026

Platforms

Gauss Cannon v1.2.0 Fixes Point Cloud Performance for Large Blender Scenes

Gauss Cannon v1.2.0 replaces per frame per mesh BVH rebuilding with a single world-space BVH in Blender.

Michael Rubloff

May 15, 2026