What “Image to 3D” really means

Single Image to 3D model is hardest and most ambiguous task compared to Multi-view / video.

Mesh: Consists of triangles. This is great for computer graphics. But, hard to optimize directly
Point Cloud: set of 3D points. Easy to get, messy to render.
Voxel grid: 3D pixels. Just simple but memory-intensive
Implicit fields: Mathematical functions such as NeRF and SDFs.
3D Gaussian Splatting: a set of Gaussians optimized for fast, high-quality novel-view rendering.

How to Get Started with Image to 3D

Summary

Before diving into the theory, you must develop 3D literacy by learning to manually handle and visually inspect 3D data firsthand.

Open3D is a powerful tool for handling Point Clouds and Meshes, Loading Data and Visualization.

Therefore, You should build your own “3D Inspector Toolkit” rather than just executing existing source code.

Your toolkit should include the following features:

Critical Precautions:

Here is a summary of my CS231A studies.

← Previous Post Next Post →