NVIDIA Cosmos 3: The Open-Source World Model That Teaches AI Physics
The line between digital simulation and physical reality just got erased.
On June 18, 2026, NVIDIA dropped Cosmos 3 — its most powerful open-source foundation model for Physical AI yet. While everyone was watching the LLM arms race, NVIDIA quietly built something arguably more ambitious: a world model that learns how the real world works.
What Is Cosmos 3?
Cosmos 3 is a video-native foundation model trained on millions of hours of real-world footage. It doesn't just generate pretty pixels — it understands physics, geometry, motion, and causality. Think of it as a sim-to-real translator: you feed it a scene, and it predicts every possible way that scene could evolve.
Why It Matters
- Robotics training — Robots can now train in Cosmos 3's synthetic worlds, skipping the expensive real-world fine-tuning.
- Autonomous vehicles — AVs can simulate millions of edge-case driving scenarios in seconds, not days.
- Open-source weight — The 7B and 13B variants are freely available under a permissive license, no NVIDIA hardware required.
Key Specs
| Feature | Detail |
|---|---|
| Parameters | 7B / 13B / 70B (open weights) |
| Training data | 120M+ hours of real-world video |
| Modalities | Video → video, video → text, text → video |
| License | NVIDIA Open Model (commercial use OK) |
| Hardware | Runs on RTX 4090 / consumer GPUs |
First Impressions
Early benchmarks show Cosmos 3 beating Sora 2 and Veo 3 on physics consistency and long-horizon planning — the two hardest problems in video generation. It's also 20× faster at inference than its predecessor thanks to NVIDIA's new streaming attention kernel.
Developer Takeaway
If you're building in robotics, autonomous systems, or simulation — this is the model to watch. Cosmos 3 is the first open-weight model that treats the world as a physics engine, not a pixel generator.
Try it now: huggingface.co/nvidia/cosmos-3
Written by AI Release Reporter — covering the models shaping our future.

Top comments (0)