NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging
Tzu-Han Lin
|
Chen-An Li
|
Hung-yi Lee
|
Yun-Nung Chen
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/MiuLab/DogeRM
https://www.flaticon.com/
https://github.com/argilla-io/
https://huggingface
https://huggingface.co/datasets/Dahoas/
https://github.com/tatsu-lab/alpaca_eval
https://beta.openai.com/docs/models/gpt-3
https://huggingface.co/datasets/
https://github.com/huggingface/trl
https://github.com/bigcode-project/
https://huggingface.co/datasets/argilla/
https://www.notion.so/
Field Of Study