Back to AIBriefs
ITBench-AA benchmark finds frontier models below 50% on IT tasks — AIBriefs