NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs Without Fine-Tuning
Adib Hasan
|
Ileana Rugina
|
Alex Wang
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, US
Venue:
BlackboxNLP |
WS |
Citations
URL
No Citations Yet
https://huggingface.co/spaces/
https://openai
Field Of Study