LaunchAI ModelsAI Agents
Jun 4, 1:02 PM
NVIDIA releases Nemotron 3 Ultra, a 550B open model for agents
550B total parameters, 55B active, hybrid Mamba-Attention MoE with 1M context window. Up to 5x faster inference and 30% lower cost for agentic tasks. Weights available on HuggingFace; available on Perplexity for Pro/Max users.
·
Jun 4, 1:02 PM
