Back to AIBriefs
AnalysisAI Models

Qwen 3.6 27B scores 2% on DeepSWE benchmark

Scored 2% (1.79% rounded), placing 18/20, above Haiku 4.5 and Minimax M2.7. Benchmark took 70 hours, averaging 32 minutes per task with 44k output tokens per task.

·
10 days ago
Qwen 3.6 27B scores 2% on DeepSWE benchmark — AIBriefs