Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing

Jiabao Ji | Bairu Hou | Alexander Robey | George J. Pappas | Hamed Hassani | Yang Zhang | Eric Wong | Shiyu Chang |

Paper Details:

Month: December
Year: 2025
Location: Mumbai, India
Venue: IJCNLP | AACL |