Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking

Junda Zhu | Lingyong Yan | Shuaiqiang Wang | Dawei Yin | Lei Sha |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |