Harvey's LAB benchmark for legal agents

LaunchDevelopersLegal

10 days ago

Harvey's LAB benchmark for legal agents

LangChain

@langchain

Powering the Agent Development Lifecycle. Makers of LangSmith and @LangChain_OSS and @LangChain_JS.

www.langchain.com

View on X

LangChain

@LangChain

.@Harvey’s LAB benchmark approaches verification like a human would. Every task in a dataset has criteria for the task to pass. Legal agents can have 50+, with each one having its own judge call. It’s easy to audit, but can be expensive at scale. LangChain Labs teamed up with

10 days ago