GuardNet uses shallow neural networks to detect prompt injection and jailbreak attacks

AnalysisCybersecurity

6 days ago

GuardNet uses shallow neural networks to detect prompt injection and jailbreak attacks

The paper introduces GuardNet, an ensemble of shallow neural networks for detecting Prompt Injection and Jailbreak attacks on LLMs. Benchmark evaluations may be affected by contamination and partial information.

6 days ago