NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
T-REG: Preference Optimization with Token-Level Reward Regularization
Wenxuan Zhou
|
Shujian Zhang
|
Lingxiao Zhao
|
Tao Meng
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/huggingface/
https://huggingface
https://github.com/tatsu-lab/alpaca_eval
https://github.com/EleutherAI/lm-evaluation-harness
Field Of Study