AdaSteer: Your Aligned LLM is Inherently an Adaptive Jailbreak Defender

Weixiang Zhao | Jiahe Guo | Yulin Hu | Yang Deng | An Zhang | Xingyu Sui | Xinyang Han | Yanyan Zhao | Bing Qin | Tat-Seng Chua | Ting Liu |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |