LaunchAI ModelsJune 2, 2026
MiniMax M3 open-weights model delivers frontier coding and native multimodality

MiniMax M3 features ~428B total parameters with ~23B activated per token, a 1M-token context window, and native multimodal support for text, image, and video. Together AI serves the model with 81–125% throughput improvements via sparse attention and paged MSA decode. The open-weights model achieves frontier coding performance and agentic capabilities.
15 sources
Serving MiniMax-M3 for efficient inference: Unlocking 1M-Token Context and Multimodality Without Regretstogether.ai
RT @novita_labs: 🤗 MiniMax M3 from @MiniMax_AI is now live on @huggingface — supported by Novita....x.com
MiniMax Sparse Attention (MSA): a Two-Branch Block-Sparse Attention Trained on a 109B-Parameter MoE With a 3T-Token Budgetmarktechpost.com
Minimax M3 open weights release planned for Fridayhuggingface.co
8-16 MI50s Minimax M3 @19 tps TG (peak)reddit.com
MiniMax M3 vs. GLM 5.2: Codegen comparison across autonomous coding tasksthinkwright.ai