AnalysisHealth
6 days ago
PSEBench evaluates LLMs for patient safety triage
PSEBench is a benchmark designed to evaluate LLMs on patient safety event triage, a high-stakes clinical task. It tests whether LLMs can correctly determine reportability under jurisdiction-specific policies.
·
6 days ago