AnalysisAI Models
8 days ago
SEA-NLI benchmark evaluates LLMs on Southeast Asian culture
SEA-NLI is a new benchmark for evaluating cultural understanding in LLMs via natural language inference, focusing on Southeast Asian contexts. Existing NLI benchmarks are largely Western-centric, limiting assessment of underrepresented cultures.
·
8 days ago