AnalysisAI ModelsHealth
2 days ago
General-purpose LLMs beat specialized clinical AI tools on medical benchmarks
Frontier LLMs outperformed specialized clinical AI tools in all three evaluations: medical knowledge, clinician alignment, and real-world clinical queries. Clinical AI tools performed comparably to auto-enabled Google Search AI Overview, despite 65% of doctors using OpenEvidence.
