NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
Zhexin Zhang
|
Junxiao Yang
|
Pei Ke
|
Fei Mi
|
Hongning Wang
|
Minlie Huang
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/thu-coai/
https://github.com/tatsu-lab/alpaca_eval
Field Of Study