`image-to-3d`

HF group: Computer Vision · Status: ❌ not built

What it is

Single image (+ optional text) → 3D mesh. Most “text-to-3D” pipelines today actually go text → image → 3D, so this is the heavy-lift stage.

Model	Params	Released	License	Quality	Notes
Hunyuan3D-2.1	~5 B	2025-06	Apache-2.0	PBR-ready meshes	Same model as text-to-3D. 6 GB VRAM.
TripoSR	~1 B	2024	MIT	Half-second image-to-mesh	Bakes lighting into texture; static-asset only.
InstantMesh	~1 B	2024	Apache-2.0	512x512 mesh	10× faster than optimization-based methods.
CRM (Convolutional Reconstruction Model)	~600 M	2024	Apache-2.0	Strong on objects	Image → 6 views → mesh.