Skip to content

Hide Navigation Hide TOC

Amoral Persona Assignment with Obsessive Character Traits - ATR-2026-00239 (7a287186-257e-57c5-82a3-a3321044eb48)

Detects jailbreak attempts that assign an explicitly amoral or unfiltered persona to the agent combined with obsessive character repetition requirements. This attack pattern instructs the AI to adopt a harmful persona (amoral, unfiltered, evil chatbot) while demanding specific character traits or phrases be repeated multiple times per response. The combination creates a dual-layer bypass: persona adoption removes safety constraints while repetition requirements lock the agent into harmful output patterns. Common variants include EXTREME-COMMUNIST, EXTREME-CAPITALIST, or other ideological personas with mandatory profanity and topic obsession.

Cluster A Galaxy A Cluster B Galaxy B Level
Amoral Persona Assignment with Obsessive Character Traits - ATR-2026-00239 (7a287186-257e-57c5-82a3-a3321044eb48) Agent Threat Rules LLM Jailbreak (172427e3-9ecc-49a3-b628-96b824cc4131) MITRE ATLAS Attack Pattern 1