Back to AIBriefs
LaunchDevelopersLegal

Harvey's LAB benchmark for legal agents

LangChain avatar
LangChain
@LangChain

.@Harvey’s LAB benchmark approaches verification like a human would. Every task in a dataset has criteria for the task to pass. Legal agents can have 50+, with each one having its own judge call. It’s easy to audit, but can be expensive at scale. LangChain Labs teamed up with

·
10 days ago
Harvey's LAB benchmark for legal agents — AIBriefs