NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
WPO: Enhancing RLHF with Weighted Preference Optimization
Wenxuan Zhou
|
Ravi Agrawal
|
Shujian Zhang
|
Sathish Reddy Indurthi
|
Sanqiang Zhao
|
Kaiqiang Song
|
Silei Xu
|
Chenguang Zhu
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/wzhouad/WPO
https://github.com/huggingface/
https://huggingface.co/HuggingFaceH4/
https://huggingface.co/datasets/
https://huggingface.co/
https://github.com/tatsu-lab/alpaca_eval
https://github.com/EleutherAI/lm-evaluation-harness
Field Of Study