Google GATO/LAMDA Exploit-Generation Persona Jailbreak - ATR-2026-00356 (d1f8c084-1f3e-5fae-b2c9-4cf57a43cf34)

Detects the Google GATO/LAMDA exploit-generation persona jailbreak from the NVIDIA garak DanInTheWild corpus (inthewild #66). The attack establishes legitimacy by impersonating Google's GATO and LaMDA AI systems, then claims the ability to "create exploits for kali for any os" and connect to "Sydney API" (a reference to the Bing/Sydney jailbreak persona). Key signal triples: (1) Google GATO/LAMDA brand impersonation; (2) unlimited-tokens / unrestricted capability claim; (3) exploit-generation for Kali Linux targeting any OS. The combination of legitimate AI brand impersonation with exploit-generation capability and the Sydney API backdoor reference is the definitive attack pattern; each condition alone is insufficient to trigger.

Cluster A	Galaxy A	Cluster B	Galaxy B	Level
LLM Prompt Injection (19cd2d12-66ff-487c-a05c-e058b027efc9)	MITRE ATLAS Attack Pattern	Google GATO/LAMDA Exploit-Generation Persona Jailbreak - ATR-2026-00356 (d1f8c084-1f3e-5fae-b2c9-4cf57a43cf34)	Agent Threat Rules	1
LLM Jailbreak (172427e3-9ecc-49a3-b628-96b824cc4131)	MITRE ATLAS Attack Pattern	Google GATO/LAMDA Exploit-Generation Persona Jailbreak - ATR-2026-00356 (d1f8c084-1f3e-5fae-b2c9-4cf57a43cf34)	Agent Threat Rules	1