Back to AIBriefs
LaunchAI Models

NVIDIA releases Nemotron 3 Ultra 550B MoE model

Nemotron 3 Ultra is a 550B total/55B active parameter Mixture-of-Experts hybrid Mamba-Transformer model, pre-trained on 20T tokens with 1M context length. It is already seeing significant adoption, reaching 35B tokens/day on OpenRouter.

ยท
Jun 16, 4:00 AM
NVIDIA releases Nemotron 3 Ultra 550B MoE model โ€” AIBriefs