AnalysisAI ModelsHealth
Jun 18, 4:00 AM
PhysAssistBench: Benchmark for LLMs in physician assistance
The benchmark evaluates LLMs on interactive doctor-patient-EHR tasks, testing multi-turn conversation, clinical reasoning, and EHR manipulation. Current models show significant gaps in integrating EHR context and handling complex patient interactions.
·
Jun 18, 4:00 AM