Gradient-Adaptive Policy Optimization: Towards Multi-Objective Alignment of Large Language Models

Chengao Li | Hanyu Zhang | Yunkun Xu | Hongyan Xue | Xiang Ao | Qing He |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |