NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment
Yuu Jinnai
|
Tetsuro Morimura
|
Kaito Ariu
|
Kenshi Abe
|
Paper Details:
Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue:
NAACL |
Citations
URL
No Citations Yet
https://github.com/Cyber
https://github.com/huggingface/trl
https://gi
https://huggingface.co/datasets/tatsu-l
https://huggingface.co/datasets/Anthropic/
https://github.com/wmt-conference/wm
https://huggingface.co
https://huggingface.co/databricks/doll
https://huggingface.co/EleutherAI/py
https://huggingface.co/EleutherAI/py
https://huggingface.co/facebook/wmt21-den
https://huggingface.co/stanfordnlp/S
https://huggingface.co/stanfordnlp/S
https://huggingface.co/OpenAssistant/rew
https://huggingface.co/llm-blender/PairR
https://huggingface.co/openbmb/Eurus-R
https://huggingface.co/sentence-transform
https://huggingface.co/Unbabel/wmt20-com
Field Of Study