What Features in Prompts Jailbreak LLMs? Investigating the Mechanisms Behind Attacks

Nathalie Maria Kirch | Constantin Niko Weisser | Severin Field | Helen Yannakoudakis | Stephen Casper |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: BlackboxNLP | WS |

Citations

URL

No Citations Yet