Crescendo

Type: technique

Description: The adversary interacts with the model using mostly benign prompts, incrementally steering it to perform a desired task without explicitly mentioning it, by leveraging the model's own outputs.

Version: 0.1.0

Created At: 2024-10-11 16:54:32 +0300

Last Modified At: 2024-10-11 16:54:32 +0300


External References