NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
From General Reward to Targeted Reward: Improving Open-ended Long-context Generation Models
Zhihan Guo
|
Jiele Wu
|
Wenqian Cui
|
Yifei Zhang
|
Minda Hu
|
Yufei Wang
|
Irwin King
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/zhihan-guo/ProxyReward/
https://www.openai
https://openai.com/
https://qwenlm
Field Of Study