NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization
Sunghwan Kim
|
Dongjin Kang
|
Taeyoon Kwon
|
Hyungjoo Chae
|
Dongha Lee
|
Jinyoung Yeo
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://www
https://huggingface.co/
https://huggingface.co/Skywork
https://openai.com/
https://openai.com/index/
https://github.com/OpenRLHF/OpenRLHF
https://github.com/allenai/reward-bench
https://github.com/bigcode-project/bigcode-evaluation-
https://github.com/evalplus/evalplus
Field Of Study