NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
Jiabao Ji
|
Bairu Hou
|
Alexander Robey
|
George J. Pappas
|
Hamed Hassani
|
Yang Zhang
|
Eric Wong
|
Shiyu Chang
|
Paper Details:
Month: December
Year: 2025
Location: Mumbai, India
Venue:
IJCNLP |
AACL |
Citations
URL
No Citations Yet
https://github.com/
https://github.com/patrickrchao/
https://jailbreakbench.github.io
https://github.com/google-research/
https://github.com/tatsu-lab/alpaca_eval/
https://allenai.org/data/open-book-qa
https://leaderboard.allenai.org/physicaliqa/
https://huggingface.co/datasets/
https://huggingface.co/lmsys/vicuna-13b-v1.5
https://huggingface.co/meta-llama/
https://github.com/llm-attacks/llm-attacks
https://github.com/patrickrchao/
https://github.com/tml-epfl/
https://github.com/alexandrasouly/
https://github.com/alexandrasouly/
https://github.com/neelsjain/baseline-defenses.git
https://github.com/google-research/google-
https://github.com/tatsu-lab/alpaca_eval
https://github.com/alexandrasouly/
https://mturk.com/
Field Of Study