Back to AIBriefs
LaunchAI ModelsAI Agents

NVIDIA releases Nemotron 3 Ultra, a 550B open model for agents

550B total parameters, 55B active, hybrid Mamba-Attention MoE with 1M context window. Up to 5x faster inference and 30% lower cost for agentic tasks. Weights available on HuggingFace; available on Perplexity for Pro/Max users.

NVIDIA releases Nemotron 3 Ultra, a 550B open model for agents — AIBriefs