TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Flex4DHuman

Flex4DHuman is a flexible multi-view video diffusion model for 4D human reconstruction. It takes one or more reference-view videos, camera poses, and target camera poses, then synthesizes synchronized novel-view videos using relative camera-pose conditioning rather than explicit geometry priors such as skeletons, depth maps, normals, or rendered target geometry. The generated dense multi-view videos can be lifted into dynamic 4D Gaussian splats, with target applications in AR/VR, gaming, simulation, video re-shooting, and scalable 4D content creation.
New Multimodal Gen 3
Released: June 11, 2026

Overview

Flex4DHuman is a multi-view video diffusion model that turns monocular or sparse multi-view videos of dynamic subjects into synchronized dense multi-view videos for 4D human reconstruction.

About World Labs

We build foundational world models that can perceive, generate, reason, and interact with the 3D world โ€” unlocking AI's full potential through spatial intelligence by transforming seeing into doing, perceiving into reasoning, and imagining into creating. We believe spatial intelligence will unlock new forms of storytelling, creativity, design, simulation, and immersive experiences across both virtual and physical worlds.

Industry: Artificial Intelligence
Company Size: 63
Location: San Francisco, CA, US
View Company Profile

Tools using Flex4DHuman

No tools found for this model yet.

Last updated: June 15, 2026
0 AIs selected
Clear selection
#
Name
Task