T-REG: Preference Optimization with Token-Level Reward Regularization

Wenxuan Zhou | Shujian Zhang | Lingxiao Zhao | Tao Meng |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |