Back to AIBriefs
AnalysisAI ModelsHealth

General-purpose LLMs beat specialized clinical AI tools on medical benchmarks

Frontier LLMs outperformed specialized clinical AI tools in all three evaluations: medical knowledge, clinician alignment, and real-world clinical queries. Clinical AI tools performed comparably to auto-enabled Google Search AI Overview, despite 65% of doctors using OpenEvidence.

General-purpose LLMs beat specialized clinical AI tools on medical benchmarks — AIBriefs