MPO: Multilingual Safety Alignment via Reward Gap Optimization

Weixiang Zhao | Yulin Hu | Yang Deng | Tongtong Wu | Wenxuan Zhang | Jiahe Guo | An Zhang | Yanyan Zhao | Bing Qin | Tat-Seng Chua | Ting Liu |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |