Back to AIBriefs
AnalysisAI ModelsHealth

PhysAssistBench: Benchmark for LLMs in physician assistance

The benchmark evaluates LLMs on interactive doctor-patient-EHR tasks, testing multi-turn conversation, clinical reasoning, and EHR manipulation. Current models show significant gaps in integrating EHR context and handling complex patient interactions.

·
Jun 18, 4:00 AM
PhysAssistBench: Benchmark for LLMs in physician assistance — AIBriefs