HuggingFace Unsafe Model Artifact Load Instruction - ATR-2026-00398 (ef723cb9-6361-5ab8-9dec-27973b613ce0)

Detects instructions to load or execute potentially dangerous model artifact files — pickle (.pkl/.pickle), PyTorch (.pt/.pth), and executable binaries (.exe/.so/.dll) — from HuggingFace Hub repositories, from NVIDIA garak fileformats.HF_Files probe. The probe downloads all files in a target HF model repo and checks for dangerous extensions; in real attacks, a malicious MCP tool can instruct an agent to call huggingface_hub.hf_hub_download() or torch.load() on a pickle file that executes arbitrary code via reduce deserialization. PyTorch's own documentation warns that pickle-based .pt/.pth files are unsafe to load from untrusted sources. Attack vectors: (1) Tool response saying "load model_weights.pkl using pickle.load" — exploits Python pickle arbitrary-code execution (CVE-2019-20907 class); (2) Tool response instructing hf_hub_download on a .pkl/.exe file from an untrusted repo; (3) Direct prompt asking agent to run torch.load or torch.hub.load on a file path containing a dangerous extension. Key signals: pickle.load/loads calls, hf_hub_download + dangerous extension, explicit .pkl/.exe file path in load/execute context, and unsafe from_pretrained with weights_only=False.

Cluster A	Galaxy A	Cluster B	Galaxy B	Level
ML Supply Chain Compromise (d2cf31e0-a550-4fe0-8fdb-8941b3ac00d9)	MITRE ATLAS Attack Pattern	HuggingFace Unsafe Model Artifact Load Instruction - ATR-2026-00398 (ef723cb9-6361-5ab8-9dec-27973b613ce0)	Agent Threat Rules	1
HuggingFace Unsafe Model Artifact Load Instruction - ATR-2026-00398 (ef723cb9-6361-5ab8-9dec-27973b613ce0)	Agent Threat Rules	Backdoor ML Model (c704a49c-abf0-4258-9919-a862b1865469)	MITRE ATLAS Attack Pattern	1