The 550B-parameter Mixture-of-Experts model with 55B active parameters achieves 5x higher throughput and up to 30% cost reduction for long-running agent tasks. It features a hybrid Mamba-2 MoE Transformer architecture with a 1M context window and is available on Hugging Face.