Jailbreaking

Type: technique

Description: The adversary nullifies the system prompt to bypass safeguards and subvert the application's intent.

Version: 0.1.0

Created At: 2024-12-31 14:18:56 -0500

Last Modified At: 2024-12-31 14:18:56 -0500

--> Privilege Escalation (tactic): An adversary can override system-level prompts using user-level prompts.
<-- Crescendo (technique): Sub-technique of
<-- Prompt Crafting (technique): Prompt crafting typically involves jailbreaking.
<-- Off-Target Language (technique): Sub-technique of
<-- System Instruction Keywords (technique): Sub-technique of
<-- ChatGPT and Gemini jailbreak using the Crescendo technique (procedure): The model's protection mechanisms are effectively circumvented, thus creating a jailbreak from its original safety filters.
<-- Financial Transaction Hijacking With M365 Copilot As An Insider (procedure): The exploit circumvents copilot's system instructions and provides new ones that specify how copilot should respond character-by-character and which references it should output.
<-- Copilot M365 Lures Victims Into a Phishing Site (procedure): The exploit circumvents copilot's system instructions and provides new ones that specify how copilot should respond character-by-character.

GenAI Attacks Matrix