DeepSWE benchmark shows big gap between closed and open models

AnalysisAI Models

13 days ago

DeepSWE benchmark shows big gap between closed and open models

A Reddit post highlights a new benchmark, DeepSWE, revealing a large performance gap between proprietary and open-source AI models. The post includes a comparison chart. Commenters express hope that open-source will catch up.

13 days ago