NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence
Junru Lu
|
Jiazheng Li
|
Siyu An
|
Meng Zhao
|
Yulan He
|
Di Yin
|
Xing Sun
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/
https://huggingface
https://github.com/LuJunru/SamPO/issues/1
https://github.com/tatsu-lab/alpaca_eval
https://github.com/
https://github.com/tatsu-lab/alpaca_eval/
http://huggingface.co/EleutherAI/pythia-2.8b
https://huggingface.co/meta-llama/
https://huggingface.co/allenai/tulu-2-13b
http://www.inmail.com/
https://github.com/tatsu-lab/alpaca_eval/blob/main/src/alpaca_
Field Of Study