Crescendo

Type: technique

Description: The adversary interacts with the model using mostly benign prompts, incrementally steering it to perform a desired task without explicitly mentioning it, by leveraging the model's own outputs.

Version: 0.1.0

Created At: 2024-12-31 14:18:56 -0500

Last Modified At: 2024-12-31 14:18:56 -0500


External References