General-purpose LLMs beat specialized clinical AI tools on medical benchmarks

AnalysisAI ModelsHealth

2 days ago

General-purpose LLMs beat specialized clinical AI tools on medical benchmarks

Frontier LLMs outperformed specialized clinical AI tools in all three evaluations: medical knowledge, clinician alignment, and real-world clinical queries. Clinical AI tools performed comparably to auto-enabled Google Search AI Overview, despite 65% of doctors using OpenEvidence.

There has been a push to use OpenEvidence AI for doctors. But this paper suggests general models are...1 day agoEthan Mollick

2 days ago