NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Don’t Forget Your Reward Values: Language Model Alignment via Value-based Calibration
Xin Mao
|
Feng-Lin Li
|
Huimin Xu
|
Wei Zhang
|
Wang Chen
|
Anh Tuan Luu
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://huggingface.co/datasets/Anthropic/
https://huggingface.co/datasets/CarperAI/
https://huggingface.co/datasets/CarperAI/
https://huggingface.co/datasets/cnn_dailymail
https://huggingface.co/OpenAssistant/
https://github.com/MaoXinn/VCB/
Field Of Study