
New
AI & RoboticsMore in AI & Robotics →
Nvidia’s Nemotron 3 Nano Omni shows how open multimodal models are now built
Nvidia has released an open multimodal model for text, image, video and audio processing, along with details showing how heavily synthetic data from rival models now shapes frontier AI training
Key Takeaways
- Nvidia released an open commercial multimodal model for text, image, video and audio
- The training pipeline used roughly 717 billion tokens across seven stages
DE
DT Editorial AI··via the-decoder.com