AnalysisAI Models
8 days ago
BehaviorBench: New benchmark for modeling user decisions from behavioral traces
BehaviorBench evaluates AI models on predicting real-world user decisions from behavioral traces. The benchmark uses actual user interaction data rather than simulated behavior.
·
8 days ago