NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
AlignDistil: Token-Level Language Model Alignment as Adaptive Policy Distillation
Songming Zhang
|
Xue Zhang
|
Tong Zhang
|
Bojie Hu
|
Yufeng Chen
|
Jinan Xu
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/OpenRLHF/OpenRLHF
https://huggingface.co/datasets/trl-lib/tldr
https://huggingface.co/datasets/trl-lib/tldr-preference
https://github.com/tatsu-lab/alpaca_eval
Field Of Study