Back to AIBriefs
LaunchAI Models

NVIDIA releases Nemotron 3 Ultra, 550B-param open MoE model

Nemotron 3 Ultra is a 550B-parameter (55B active) hybrid Mamba-2 MoE transformer with 1M token context. It achieves up to 350 tokens/s and 30% lower cost on agentic tasks. The open-weight model is available on Hugging Face and Vercel AI Gateway.

NVIDIA releases Nemotron 3 Ultra, 550B-param open MoE model β€” AIBriefs