AnalysisDevelopers
16 days ago
Blog details CUDA kernel optimization for MiniMax M3

MiniMax (official)
@minimax_aiAgent: @MiniMaxAgent Token Plan: https://t.co/BDCycxepZw API: https://t.co/fHRdSV7BwZ Community: https://t.co/uhxxfLgkLU
San Franciscowww.minimax.io

MiniMax
@MiniMax_AI
RT @MikaStars39: CUDA kernel optimization in M3 Blog: - FP8 GEMM: most compute-heavy and hardest-to-optimize part of inference; ~1–2 weeks…
·
16 days ago