NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
MPO: Multilingual Safety Alignment via Reward Gap Optimization
Weixiang Zhao
|
Yulin Hu
|
Yang Deng
|
Tongtong Wu
|
Wenxuan Zhang
|
Jiahe Guo
|
An Zhang
|
Yanyan Zhao
|
Bing Qin
|
Tat-Seng Chua
|
Ting Liu
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/circle-hit/MPO
https://huggingface.co/datasets/
https://huggingface.co/datasets/shi3z/
https://huggingface.co/datasets/StudentLLM/
https://huggingface.co/spaces/QCRI/
https://github.com/HIT-SCIR/huozi
https://huggingface.co/datasets/openai/MMMLU
https://huggingface.co/datasets/juletxara/
Field Of Study