AnalysisAI Models
23 hours ago
New paper proposes method to anticipate LLM failure modes without evaluation
Naomi Saphra
@nsaphra.bsky.socialWaiting on a robot body. All opinions are universal and held by both employers and family. ML/NLP professor. nsaphra.net
Naomi Saphra
@nsaphra.bsky.social
We don’t always know what problems are hard for LLMs. So devs evaluate on tasks HUMANS find hard or on broad benchmarks. What if we could instead anticipate which scenarios a model will fail on—all without evaluating specific input examples? 🧵NEW PAPER by @jenniferlumeng.bsky.social
·
23 hours ago