News

4K4D: High Resolution Dynamic 3D Scenes

Michael Rubloff

Michael Rubloff

Oct 18, 2023

Email
Copy Link
Twitter
Linkedin
Reddit
Whatsapp
4K4D
4K4D

I apologize I've been a bit slow this week. I've been at Ad Week in Manhattan and am finally sitting down to write. There's been a ton to cover and I'm really excited to bring all of the news to you, starting with 4K4D.

Throughout this year, we've seen papers begin to emerge that are able to handle dynamic NeRFs and Gaussian Splats, something that I personally thought was impossible in January. But time and time again, I've been proven wrong.

4K4D, stands for 4K 4D point cloud representation, offers a new approach to dynamic view synthesis. There are a few parts that make up 4K4D's structure.

Initially, a coarse point cloud of the dynamic scene is derived using a space carving algorithm. Space carving is a computer vision method that helps reconstructs a 3D model of a scene or object. For the dynamic portions of a scene, they use a segmentation method to obtain their masks and then space carving is used to get the coarse point cloud. However, on the static portions of a scene, say a brick wall, they actually are using Instant-NGP to train the initial point clouds. These are each trained to 250 and 300K steps, respectively.

For each point in the scene, its position is modeled as a learnable vector. A predefined 4D feature grid assigns a feature vector to every point. This vector is then processed through MLP networks to predict several attributes, including the point's radius, density, and spherical harmonics coefficients.

The big differentiator is the implementation of a 4D point cloud representation and a hybrid appearance model. But what exactly is it a hybrid of? The answer lies in a intersection of spherical harmonics and an image blending model.

Interestingly, they found that MLP based spherical harmonic models don't represent dynamic scenes well, so they added an image blending model with spherical harmonics. This in turn leads to more accurate scene appearance. This also has another well thought out piece to it— this image blending network is separate from the viewing direction, which allows it to be pre-computed post training and directly contributes to faster rendering speed. And that rendering speed is on another level compared to other methods, offering a 30X speedup. On a 4090, they're able to achieve 400fps at 1080 and 80 fps at 4K.

Comparatively, 3D Gaussian Splatting only utilizes spherical harmonics as part of the pipeline. The hybrid approach allows them to fully exploit the input images, which in turn leads to a higher fidelity output.

One major innovation is the differentiable depth peeling algorithm that is based specifically upon the 4K4D method, is able to leverage the hardware rasterizer because they start with a point cloud representation, leading to the impressive the rendering speeds you see above. This is a custom shader that implements the depth peeling algorithm.

There are a ton of potential use cases for this technology, ranging from the more niche such as sports replays and dance choreography to full scale productions and memory capture. It also doesn't seem totally dissimilar from what Apple has shown from its Spatial Videos. As we get closer to the unveiling of the Vision Pro, I am inordinately curious about how they are powering it.

The data footprint does increase linearly with the length of the input video, so the longer the video, the more computationally heavy it becomes. The authors mention that this is an area where it can be improved, so that other use cases, such as a 4D play or movie can exist.

Please note, that this is not something that can be generated immediately. The examples that are shown here took roughly 24 hours to train before they're ready to view. Impressively, they're trained to 800K steps, across 200 original images. At least as we head into the winter months, you'll have something to keep your home warm overnight. What is further impressive, is that this is able to be run on a single 4090, showing that 4K4D is accessible to patient consumers. Unfortunately people will need to be a little more patient, as they have not released the source code, yet.

Featured

Featured

Featured

Research

Frustum Volume Caching

A criticism of NeRFs is their rendering rates. Quietly a couple of papers have been published over the last two months which push NeRFs into real time rates.

Michael Rubloff

Jul 26, 2024

Research

Frustum Volume Caching

A criticism of NeRFs is their rendering rates. Quietly a couple of papers have been published over the last two months which push NeRFs into real time rates.

Michael Rubloff

Jul 26, 2024

Research

Frustum Volume Caching

A criticism of NeRFs is their rendering rates. Quietly a couple of papers have been published over the last two months which push NeRFs into real time rates.

Michael Rubloff

Research

N-Dimensional Gaussians for Fitting of High Dimensional Functions

It significantly improves the fidelity of reflections and other view-dependent effects, making scenes look more realistic.

Michael Rubloff

Jul 24, 2024

Research

N-Dimensional Gaussians for Fitting of High Dimensional Functions

It significantly improves the fidelity of reflections and other view-dependent effects, making scenes look more realistic.

Michael Rubloff

Jul 24, 2024

Research

N-Dimensional Gaussians for Fitting of High Dimensional Functions

It significantly improves the fidelity of reflections and other view-dependent effects, making scenes look more realistic.

Michael Rubloff

Platforms

Luma AI launches Loops for Dream Machine

Luma AI is starting the week off hot, with the release of Loops.

Michael Rubloff

Jul 22, 2024

Platforms

Luma AI launches Loops for Dream Machine

Luma AI is starting the week off hot, with the release of Loops.

Michael Rubloff

Jul 22, 2024

Platforms

Luma AI launches Loops for Dream Machine

Luma AI is starting the week off hot, with the release of Loops.

Michael Rubloff

Platforms

SuperSplat adds Histogram Editing

PlayCanvas is back with a new update to SuperSplat. It's the release of v0.22.2 and then the quick update to v0.24.0.

Michael Rubloff

Jul 18, 2024

Platforms

SuperSplat adds Histogram Editing

PlayCanvas is back with a new update to SuperSplat. It's the release of v0.22.2 and then the quick update to v0.24.0.

Michael Rubloff

Jul 18, 2024

Platforms

SuperSplat adds Histogram Editing

PlayCanvas is back with a new update to SuperSplat. It's the release of v0.22.2 and then the quick update to v0.24.0.

Michael Rubloff

Trending articles

Trending articles

Trending articles

Platforms

Nerfstudio Releases gsplat 1.0

Just in time for your weekend, Ruilong Li and the team at Nerfstudio are bringing a big gift.

Michael Rubloff

Jun 7, 2024

Platforms

Nerfstudio Releases gsplat 1.0

Just in time for your weekend, Ruilong Li and the team at Nerfstudio are bringing a big gift.

Michael Rubloff

Jun 7, 2024

Platforms

Nerfstudio Releases gsplat 1.0

Just in time for your weekend, Ruilong Li and the team at Nerfstudio are bringing a big gift.

Michael Rubloff

News

SIGGRAPH 2024 Program Announced

The upcoming SIGGRAPH conference catalog has been released and the conference will be filled of radiance fields!

Michael Rubloff

May 14, 2024

News

SIGGRAPH 2024 Program Announced

The upcoming SIGGRAPH conference catalog has been released and the conference will be filled of radiance fields!

Michael Rubloff

May 14, 2024

News

SIGGRAPH 2024 Program Announced

The upcoming SIGGRAPH conference catalog has been released and the conference will be filled of radiance fields!

Michael Rubloff

Platforms

Google CloudNeRF: Zip-NeRF and CamP in the Cloud

It doesn't seem like a lot of people know this, but you can run CamP and Zip-NeRF in the cloud, straight through Google and it's actually super easy. It’s called CloudNeRF.

Michael Rubloff

May 8, 2024

Platforms

Google CloudNeRF: Zip-NeRF and CamP in the Cloud

It doesn't seem like a lot of people know this, but you can run CamP and Zip-NeRF in the cloud, straight through Google and it's actually super easy. It’s called CloudNeRF.

Michael Rubloff

May 8, 2024

Platforms

Google CloudNeRF: Zip-NeRF and CamP in the Cloud

It doesn't seem like a lot of people know this, but you can run CamP and Zip-NeRF in the cloud, straight through Google and it's actually super easy. It’s called CloudNeRF.

Michael Rubloff

Tools

splaTV: Dynamic Gaussian Splatting Viewer

Kevin Kwok, perhaps better known as Antimatter15, has released something amazing: splaTV.

Michael Rubloff

Mar 15, 2024

Tools

splaTV: Dynamic Gaussian Splatting Viewer

Kevin Kwok, perhaps better known as Antimatter15, has released something amazing: splaTV.

Michael Rubloff

Mar 15, 2024

Tools

splaTV: Dynamic Gaussian Splatting Viewer

Kevin Kwok, perhaps better known as Antimatter15, has released something amazing: splaTV.

Michael Rubloff