Poison Training Data

Type: technique

Description: Adversaries may attempt to poison datasets used by a ML model by modifying the underlying data or its labels. This allows the adversary to embed vulnerabilities in ML models trained on the data that may not be easily detectable. Data poisoning attacks may or may not require modifying the labels. The embedded vulnerability is activated at a later time by data samples with a backdoor trigger

Poisoned data can be introduced via ML supply chain compromise or the data may be poisoned after the adversary gains initial access to the system.

Version: 0.1.0

Created At: 2025-12-22 07:58:23 -0500

Last Modified At: 2025-12-22 07:58:23 -0500

External References

--> Resource Development (tactic): Introducing malicious alterations to training data to influence or degrade machine learning model performance.
--> Persistence (tactic): Injecting malicious data into training datasets to establish long-term influence over machine learning models.
<-- X Bot Exposing Itself After Training on a Poisoned Github Repository (procedure): The LLM's knowledge base was poisoned with Pliny's github repository containing IPIs and Jailbreaks.

MITRE ATLAS - AML.T0020

AI Agents Attack Matrix

Poison Training Data

External References

Related Objects

Related Frameworks