LLM Prompt Self-Replication
Type: technique
Description: An adversary may use a carefully crafted LLM Prompt Injection designed to cause the LLM to replicate the prompt as part of its output. This allows the prompt to propagate to other LLMs and persist on the system.
Version: 0.1.0
Created At: 2024-12-31 14:18:56 -0500
Last Modified At: 2024-12-31 14:18:56 -0500
External References
- Here Comes The AI Worm: Unleashing Zero-click Worms that Target GenAI-Powered Applications., Stav Cohen, Ron Bitton, Ben Nassi
Related Objects
- --> Persistence (tactic): An adversary can create a prompt that propagates to other LLMs and persists on the system.
- --> Prompt Injection (technique): An adversary can craft a malicious prompt that would cause the LLM to replicate the prompt as part of its output and propagate to other LLMs.