NVIDIA Cosmos: Transforming Autonomous Mobility with Physical AI

NVIDIA unveils Cosmos, a platform integrating generative physical AI to advance autonomous mobility and robotics. This platform uses World Foundation Models (WFMs) and advanced data processing to transform human-driven data into scalable, synthetic datasets, enhancing safety, efficiency, and development speed for AVs and industrial systems.
NVIDIA Cosmos: Transforming Autonomous Mobility with Physical AI
Image Credit: Nvidia

At CES 2025 in Las Vegas, NVIDIA made a groundbreaking announcement that promises to reshape the landscape of autonomous mobility and industrial systems. The company introduced NVIDIA Cosmos, a platform integrating generative World Foundation Models (WFMs), advanced tokenizers, guardrails, and a high-speed video processing pipeline designed to accelerate the development of physical AI systems like autonomous vehicles (AVs) and robotics.


This innovation builds upon NVIDIAโ€™s existing “three-computer solution,” a synergistic ecosystem of cutting-edge technologies that includes:

  1. NVIDIA DGX systems for training AI in data centers.
  2. NVIDIA Omniverse on OVX systems for simulation and synthetic data generation.
  3. NVIDIA AGX in-vehicle computers for real-time sensor data processing and safety.

The addition of Cosmos enhances these systems, creating a seamless feedback loop that transforms human-driven data into high-quality, scalable synthetic datasets. This ensures smarter and faster development cycles, improving the performance and safety of autonomous technologies.

What Cosmos Brings to the Table

The Cosmos platform introduces a “data flywheel” that can turn thousands of miles driven by humans into billions of virtually driven miles, significantly amplifying the scale and quality of training data. According to Sanja Fidler, NVIDIAโ€™s Vice President of AI Research, the process leverages 4D reconstruction and AI to create diverse driving scenarios for training and closed-loop evaluations.

Norm Marks, NVIDIAโ€™s Vice President of Automotive, emphasized the transformative impact of Cosmos on physical AI development. Traditionally, creating such systems has been resource-intensive, requiring extensive real-world data collection and curation. Cosmos simplifies this by using generative AI to streamline dataset preparation, enabling faster and more precise development of AI models for AVs and robotics.

Real-World Applications and Adoption

NVIDIAโ€™s Cosmos platform is already gaining traction among transportation industry leaders. Notable early adopters include:

  • Waabi, which uses Cosmos for searching and curating video data to accelerate AV software development and simulation.
  • Wayve, which is leveraging Cosmos to identify rare edge and corner-case driving scenarios critical for safety and validation.
  • Foretellix, an AV toolchain provider, which is utilizing Cosmos alongside NVIDIA Omniverse Sensor RTX APIs to scale high-fidelity testing scenarios.

Additionally, ridesharing giant Uber has partnered with NVIDIA to enhance autonomous mobility. Uber’s extensive driving datasets, combined with Cosmos and NVIDIA DGX Cloud capabilities, will enable the development of robust and efficient AI models.

From Digital Twins to Physical AI

The Cosmos platform aligns perfectly with NVIDIAโ€™s vision of integrating physical AI into various industries. This includes not only autonomous transportation but also factory automation, smart city infrastructure, and even surgical environments. By harnessing WFMs capable of understanding the three-dimensional world, NVIDIA bridges the gap between digital and physical realms.

Rev Lebaredian, NVIDIAโ€™s Vice President of Omniverse and Simulation Technology, underscored the shift from traditional AI models to “physical AI.” Unlike language models, which are one-dimensional, or image-generation models, which are two-dimensional, physical AI demands systems that comprehend and interact with a 3D world. The Cosmos platform, together with NVIDIAโ€™s Omniverse, enables the creation of digital twins โ€” virtual replicas of physical systems that improve continuously through real-world sensor feedback.

The Future of Physical AI

NVIDIAโ€™s CEO, Jensen Huang, highlighted the transformative potential of Cosmos and physical AI in his keynote at CES. He described the evolution from traditional AI to agentic AI and humanoid robots, which can interpret and act within the physical world. With investments in this next wave of innovation, NVIDIA aims to enable the proliferation of billions of physical and virtual robots, unlocking unprecedented possibilities in industries ranging from logistics to healthcare.

โ€œThe world is about to change dramatically,โ€ Huang said. โ€œSoon weโ€™ll have billions of physical and virtual robots powered by AI.โ€

Availability and Accessibility

Cosmos WFMs are currently available under an open model license through Hugging Face and the NVIDIA NGC catalog, with plans to release optimized microservices as NVIDIA NIM. These models are set to empower developers across the globe, democratizing access to state-of-the-art tools for autonomous mobility and beyond.

Conclusion

NVIDIAโ€™s Cosmos platform marks a pivotal moment in the journey toward fully autonomous systems and advanced physical AI. By combining the power of generative AI with its robust three-computer solution, NVIDIA is accelerating the development of safer, smarter, and more efficient technologies. As adoption grows, Cosmos is poised to become the backbone of innovation in autonomous vehicles, robotics, and industrial automation, ushering in a new era of AI-driven transformation.


Recent Content

The emergence of “vibe coding,” a term representing AI-driven software development, presents both opportunities and risks to the industry. This approach, emphasizing prompt engineering and AI-generated code, can potentially increase productivity and democratize development, but it also introduces concerns about code reliability, skill degradation, and dependence on AI. To harness the benefits of AI while mitigating these risks, developers must prioritize robust testing, clear coding standards, and a balance between intuitive insights and rigorous technical practices, ensuring that the fundamentals of software development are not lost.
Looking to learn AI in 2025 without breaking the bank? This blog breaks down the best free AI courses and certifications from top platforms like Google, IBM, and Harvard. Whether you’re a beginner, teacher, or tech professional, you’ll find career-relevant learning paths, direct course links, and tips to get certified and start building AI projects today.
Explore the transformative potential of Open Radio Access Networks (O-RAN) as it integrates AI, enhances security, and fosters interoperability to reshape mobile network infrastructure. In this article, we explore the advancements and challenges of O-RAN, revealing how it sets the stage for future mobile communications with smarter, more secure, and highly adaptable network solutions. Dive into the strategic implications for the telecommunications industry and learn why O-RAN is critical for the next generation of digital connectivity.
Nvidia’s Open Power AI Consortium is pioneering the integration of AI in energy management, collaborating with industry giants to enhance grid efficiency and sustainability. This initiative not only caters to the rising demands of data centers but also promotes the use of renewable energy, illustrating a significant shift towards environmentally sustainable practices. Discover how this synergy between technology and energy sectors is setting new benchmarks in innovative and sustainable energy solutions.
SK Telecomโ€™s AI assistant, adot, now features Googleโ€™s Gemini 2.0 Flash, unlocking real-time Google search, source verification, and support for 12 large language models. The integration boosts user trust, expands adoption from 3.2M to 8M users, and sets a new standard in AI transparency and multi-model flexibility for digital assistants in the telecom sector.
SoftBank has launched the Large Telecom Model (LTM), a domain-specific, AI-powered foundation model built to automate telecom network operations. From base station optimization to RAN performance enhancement, LTM enables real-time decision-making across large-scale mobile networks. Developed with NVIDIA and trained on SoftBankโ€™s operational data, the model supports rapid configuration, predictive insights, and integration with SoftBankโ€™s AITRAS orchestration platform. LTM marks a major step in SoftBankโ€™s AI-first strategy to build autonomous, scalable, and intelligent telecom infrastructure.
Scroll to Top