PhysAssistBench: Benchmark for LLMs in physician assistance

AnalysisAI ModelsHealth

Jun 18, 4:00 AM

PhysAssistBench: Benchmark for LLMs in physician assistance

The benchmark evaluates LLMs on interactive doctor-patient-EHR tasks, testing multi-turn conversation, clinical reasoning, and EHR manipulation. Current models show significant gaps in integrating EHR context and handling complex patient interactions.

Jun 18, 4:00 AM