Spotlighting
Type: mitigation
Description: A defense mechanism that uses prompt engineering techniques to avoid indirect or direct prompt injection by highlighting the query
Version: 0.1.0
Created At: 2024-12-31 14:18:56 -0500
Last Modified At: 2024-12-31 14:18:56 -0500
External References
Related Objects
- --> ChatGPT (platform): Evaluation of the above mitigation strategies leveraged GPT 3.5 and GPT 4.
- <-- Prompt Injection (technique): By spotlighting in prompts, the LLM focuses on a specific part of the query that defines the task, thus avoiding other injected tasks.