Safety Alignment via Constrained Knowledge Unlearning

Zesheng Shi | Yucheng Zhou | Jing Li | Yuxin Jin | Yu Li | Daojing He | Fangming Liu | Saleh Alharbi | Jun Yu | Min Zhang |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |