Back to AIBriefs
LaunchAI Models

NVIDIA releases Nemotron 3 Ultra, a 550B MoE open model for agents

The 550B-parameter Mixture-of-Experts model with 55B active parameters achieves 5x higher throughput and up to 30% cost reduction for long-running agent tasks. It features a hybrid Mamba-2 MoE Transformer architecture with a 1M context window and is available on Hugging Face.

·
12 days ago
NVIDIA releases Nemotron 3 Ultra, a 550B MoE open model for agents — AIBriefs