NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
DPL: Diverse Preference Learning Without A Reference Model
Abhijnan Nath
|
Andrey Volozin
|
Saumajit Saha
|
Albert Aristotle Nanda
|
Galina Grunin
|
Rahul Bhotika
|
Nikhil Krishnaswamy
|
Paper Details:
Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue:
NAACL |
Citations
URL
No Citations Yet
https://huggingface.co/datasets/argilla/
https://github.com/openai/
https://pypi.org/project/editdistance/
https://huggingface.co/microsoft/
https://huggingface.co/docs/trl/en/sft_
https://github.com/
https://huggingface.co/docs/trl/en/index
https://crfm
https://github
Field Of Study