Generative Reward Modeling via Synthetic Criteria Preference Learning

Xiaobo Liang | Haoke Zhang | Juntao Li | Kehai Chen | Qiaoming Zhu | Min Zhang |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |