NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Towards Reward Fairness in RLHF: From a Resource Allocation Perspective
Sheng Ouyang
|
Yulan Hu
|
Ge Chen
|
Qingyang Li
|
Fuzheng Zhang
|
Yong Liu
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/
https://github.com/nyu-mll/crows-pairs/
https://huggingface.co/datasets/Anthropic/hh-rlhf
https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized
https://huggingface.co/datasets/stanfordnlp/SHP
https://github.com/ContextualAI/HALOs/
https://huggingface.co/RLHFlow/LLaMA3-SFT
https://github.com/RLHFlow/Online-RLHF/
https://huggingface.co/datasets/RLHFlow/RLHFlow-SFT-Dataset-ver2
Field Of Study