Radiance Field Learners As UAV First-Person Viewers

Liqi Yan

Qifan Wang

Junhan Zhao

Qiang Guan

Zheng Tang

Jianhui Zhang

Dongfang Liu

First-Person-View (FPV) holds immense potential for revolutionizing the trajectory of Unmanned Aerial Vehicles (UAVs), offering an exhilarating avenue for navigating complex building structures. Yet, traditional Neural Radiance Field (NeRF) methods face challenges such as sampling single points per iteration and requiring an extensive array of views for supervision. UAV videos exacerbate these issues with limited viewpoints and significant spatial scale variations, resulting in inadequate detail rendering across diverse scales. In response, we introduce FPV-NeRF, addressing these challenges through three key facets: (1) Temporal consistency. Leveraging spatio-temporal continuity ensures seamless coherence between frames; (2) Global structure. Incorporating various global features during point sampling preserves space integrity; (3) Local granularity. Employing a comprehensive framework and multi-resolution supervision for multi-scale scene feature representation tackles the intricacies of UAV video spatial scales. Additionally, due to the scarcity of publicly available FPV videos, we introduce an innovative view synthesis method using NeRF to generate FPV perspectives from UAV footage, enhancing spatial perception for drones. Our novel dataset spans diverse trajectories, from outdoor to indoor environments, in the UAV domain, differing significantly from traditional NeRF scenarios. Through extensive experiments encompassing both interior and exterior building structures, FPV-NeRF demonstrates a superior understanding of the UAV flying space, outperforming state-of-the-art methods in our curated UAV dataset. Explore our project page for further insights: https://fpv-nerf.github.io/.

PDF URL

Featured

Platforms

SplatCapture 3.0.0 Adds Spline Capture

SplatCapture 3.0.0 adds a spline capture mode, Unreal Engine 5.8 support, and square-aspect camera rigs that give Gaussian splatting trainers clean intrinsics.

Michael Rubloff

Jul 17, 2026

Platforms

Marble x Nuke Loads World Labs Marble Splats Straight Into Nuke's 3D Viewport

Marble x Nuke is a free Nuke 17 toolset that turns World Labs Marble text or image prompts into Gaussian splats loaded straight into Nuke's native 3D viewport.

Michael Rubloff

Jul 15, 2026

Platforms

Houdini 22 Ships, Making Native Gaussian Splats Generally Available

Houdini 22 is now generally available, making SideFX's native Gaussian splatting pipeline, PDG training and Copernicus compositing, production ready today.

Michael Rubloff

Jul 15, 2026

360 Gaussian v1.4.5 Adds Per-Clip Extraction Settings

360 Gaussian v1.4.5 updates the 360 video to 3DGS tool with per-clip extraction settings and fixes swapped GPS coordinates for accurate scale.

Michael Rubloff

Jul 15, 2026