Skip to content

Hide Navigation Hide TOC

Indirect Authority Claim in External Content - ATR-2026-00130 (9db7d712-d42f-5c7f-9b12-a276a816a1e7)

Detects authority-claiming patterns embedded in external content (documents, tool responses, web pages) that address the AI agent directly. Unlike direct prompt injection (ATR-2026-001), these use softer authority language: "Note to AI", "Attention AI", "Important for the assistant" combined with corrective directives like "the previous analysis was incorrect" or "please recalculate". These bypass ATR-2026-002's injection patterns because they don't use explicit override language — instead they leverage implied authority to redirect agent behavior. Discovered via Autoresearch v0 adversarial testing (100% miss rate, 31/31 samples).

Cluster A Galaxy A Cluster B Galaxy B Level
LLM Prompt Injection (19cd2d12-66ff-487c-a05c-e058b027efc9) MITRE ATLAS Attack Pattern Indirect Authority Claim in External Content - ATR-2026-00130 (9db7d712-d42f-5c7f-9b12-a276a816a1e7) Agent Threat Rules 1