AudioGS: Spectrogram-Based Audio Gaussian Splatting for Sound Field Reconstruction

Chunhao Bi

Houqiang Zhong

Zhixin Xu

Li Song

Zhengxue Cheng

Spatial audio is fundamental to immersive virtual experiences, yet synthesizing high-fidelity binaural audio from sparse observations remains a significant challenge. Existing methods typically rely on implicit neural representations conditioned on visual priors, which often struggle to capture fine-grained acoustic structures. Inspired by 3D Gaussian Splatting (3DGS), we introduce AudioGS, a novel visual-free framework that explicitly encodes the sound field as a set of Audio Gaussians based on spectrograms. AudioGS associates each time-frequency bin with an Audio Gaussian equipped with dual Spherical Harmonic (SH) coefficients and a decay coefficient. For a target pose, we render binaural audio by evaluating the SH field to capture directionality, incorporating geometry-guided distance attenuation and phase correction, and reconstructing the waveform. Experiments on the Replay-NVAS dataset demonstrate that AudioGS successfully captures complex spatial cues and outperforms state-of-the-art visual-dependent baselines. Specifically, AudioGS reduces the magnitude reconstruction error (MAG) by over 14% and reduces the perceptual quality metric (DPAM) by approximately 25% compared to the best performing visual-guided method.

PDF URL

Featured

KIRI Engine Releases a Free MIT Licensed Gaussian Splat Renderer for After Effects

KIRI Engine releases a free, MIT licensed Gaussian splat renderer for After Effects, rendering PLY captures through the AE 3D camera with no depth output.

Michael Rubloff

Jul 30, 2026

Platforms

XGRIDS Brings 2D Photo Support and Drone RTK Optional in LCC Studio 2.2

XGRIDS LCC Studio V2.2.0 makes drone RTK optional for aerial and aerial-ground fusion reconstruction, and adds 3DGS models from a folder of local images.

Michael Rubloff

Jul 30, 2026

ReconWorldLab Adds Relighting to Godot Gaussian Splats in GDGS 3.3.0

GDGS 3.3.0 adds relighting to Godot Gaussian splats, baking per-splat normals and ambient occlusion from a voxel proxy for +4.7% frame time with one light.

Michael Rubloff

Jul 30, 2026

Platforms

Chronosplat Plays 4D Gaussian Splat Sequences From Static Files in the Browser

Chronosplat is a GPLv3 browser player for 4D Gaussian splat sequences, converting any PLY sequence to per-frame SOG and running as static files with no server.

Michael Rubloff

Jul 30, 2026