Back to AIBriefs
AnalysisPolicyAI Agents
Featured

Spec-Driven Testing for Agents via Poem Jailbreaks

Wrapping a malicious instruction in a poem is an effective jailbreak against large models but not small ones. Steven Willmott argues this shows larger models aren't straightforwardly better.

·
10 days ago
Spec-Driven Testing for Agents via Poem Jailbreaks — AIBriefs