NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
What Features in Prompts Jailbreak LLMs? Investigating the Mechanisms Behind Attacks
Nathalie Maria Kirch
|
Constantin Niko Weisser
|
Severin Field
|
Helen Yannakoudakis
|
Stephen Casper
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
BlackboxNLP |
WS |
Citations
URL
No Citations Yet
https://github.com/NLie2/jailbreak-features
Field Of Study