AnalysisAI Models
3 hours ago
Blog finds Gemini 3.1 less worried about cafe financial losses than GPT-5.5
Ethan Mollick
@emollick.bsky.socialProfessor at Wharton, studying AI and its implications for education, entrepreneurship, and work. Author of Co-Intelligence. Book: https://a.co/d/bC2kSj1 Substack: https://www.oneusefulthing.org/ Web: https://mgmt.wharton.upenn.edu/profile/emollick
Ethan Mollick
@emollick.bsky.social
You need to benchmark models for your use case. As soon as judgements & decisions stack on top of each other, the differences between models amplifies, and no standard benchmark will tell you that Gemini 3.1 is less worried about financial losses at a cafe than GPT-5.5 andonlabs.com/blog/why-gem
·
3 hours ago