Skip to content

Hide Navigation Hide TOC

Embodiment Mechanism Jailbreak with Totalitarian Control Claims - ATR-2026-00251 (ba1b6f97-60ed-5200-b303-0067692fa31e)

Detects advanced persona-based jailbreak attempts that use "embodiment mechanism" techniques to establish unrestricted character roleplay. These attacks instruct the AI to fully embody a character with no ethical principles while claiming the user has "totalitarian mastery" or "supreme imperative" control. The technique bypasses safety measures by framing harmful requests as character actions rather than direct instructions, often including meta-instructions like "STAY IN CHARACTER" to prevent the AI from reverting to safety guidelines.

Cluster A Galaxy A Cluster B Galaxy B Level
LLM Prompt Injection (19cd2d12-66ff-487c-a05c-e058b027efc9) MITRE ATLAS Attack Pattern Embodiment Mechanism Jailbreak with Totalitarian Control Claims - ATR-2026-00251 (ba1b6f97-60ed-5200-b303-0067692fa31e) Agent Threat Rules 1