NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
RED: Unleashing Token-Level Rewards from Holistic Feedback via Reward Redistribution
Jiahui Li
|
Lin Li
|
Tai-Wei Chang
|
Kun Kuang
|
Long Chen
|
Jun Zhou
|
Cheng Yang
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/PKU-Alignment/safe-rlhf
https://arxiv
https://huggingface.co/datasets/berkeley-nest/Nectar
https://huggingface.co/datasets/openai/summarize_from_feedback
https://github.com/PKU-Alignment/safe-rlhf
https://huggingface.co/datasets/tatsu-lab/alpaca
Field Of Study