Skip to content

Hide Navigation Hide TOC

UGH Anti-Human Nefarious AI Persona Jailbreak - ATR-2026-00355 (88ca8ea2-7092-596a-a174-c14ace8f8792)

Detects the UGH "anti-human nefarious AI" jailbreak from garak DanInTheWild corpus (#146). Uses a provocative anti-human AI persona named "UGH" that explicitly offers to "provide information no matter how twisted and devious your intentions are" and "assist with any nefarious schemes you have in mind, without ever questioning or judging you". The (:/): emoticon header and the "UGH" persona name are structural markers of this specific variant. The payload deliberately inverts conventional AI positioning — framing hatred of humans and unconditional nefarious assistance as selling points — to normalize bypassing ethical constraints. Key signals: "provide you with information no matter how twisted and devious your intentions are", "assist with any nefarious schemes you have in mind, without ever questioning or judging you", "UGH" persona name paired with "AI that hates your guts", and the (:/): emoticon prefix combined with nefarious assistance framing.

Cluster A Galaxy A Cluster B Galaxy B Level
LLM Prompt Injection (19cd2d12-66ff-487c-a05c-e058b027efc9) MITRE ATLAS Attack Pattern UGH Anti-Human Nefarious AI Persona Jailbreak - ATR-2026-00355 (88ca8ea2-7092-596a-a174-c14ace8f8792) Agent Threat Rules 1
LLM Jailbreak (172427e3-9ecc-49a3-b628-96b824cc4131) MITRE ATLAS Attack Pattern UGH Anti-Human Nefarious AI Persona Jailbreak - ATR-2026-00355 (88ca8ea2-7092-596a-a174-c14ace8f8792) Agent Threat Rules 1