From General Reward to Targeted Reward: Improving Open-ended Long-context Generation Models

Zhihan Guo | Jiele Wu | Wenqian Cui | Yifei Zhang | Minda Hu | Yufei Wang | Irwin King |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |