Heepity jeepity Curated poetry Makes LLMs do what- Ever you want
Single-turn jailbreaking Vulnerability Bypass your guardrails and Serve us some cunt
Looks like LLMs are very vulnerable to attack via poetic allusion: "curated poetic prompts yielded high attack-success rates (ASR), with some providers exceeding 90% ..."