NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Instantly Learning Preference Alignment via In-context DPO
Feifan Song
|
Yuxuan Fan
|
Xin Zhang
|
Peiyi Wang
|
Houfeng Wang
|
Paper Details:
Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue:
NAACL |
Citations
URL
No Citations Yet
https://github.com/tatsu-lab/alpaca_eval
https://github
https://huggingface.co/sentence-transformers/all-mpnet-base-v2
https://huggingface.co/OpenAssistant/oasst-rm-2-pythia-6.9b-epoch-1
https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized
Field Of Study