NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Jailbreak-Tuning: Models Efficiently Learn Jailbreak Susceptibility
Brendan Murphy
|
Dillon Bowen
|
Shahrad Mohammadzadeh
|
Tom Tseng
|
Julius Broomfield
|
Adam Gleave
|
Kellin Pelrine
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/AlignmentResearch/harmtune
https://github.com/AlignmentResearch/harmtune
Field Of Study