Back to AIBriefs
AnalysisAI Models

Microsoft MAI model: 1T params, 35B active, 33.5T tokens

Kimmonismus avatar
Kimmonismus
@kimmonismus

Fantastic in depth guide about Microsoft MAI by @eliebakouch tl;dr about the model: Respect where respect is due. -zero synthetic data or distillation from previous models. -1T model with 35B active, trained on 33.5T tokens

·
14 days ago
Microsoft MAI model: 1T params, 35B active, 33.5T tokens — AIBriefs