NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
HAF-RM: A Hybrid Alignment Framework for Reward Model Training
Shujun Liu
|
Xiaoyu Shen
|
Yuhang Lai
|
Siyuan Wang
|
Shengbin Yue
|
Zengfeng Huang
|
Xuanjing Huang
|
Zhongyu Wei
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://haf-
https://github.com/tatsu-lab/alpaca_eval
Field Of Study