LeTS: Learning to Think-and-Search via Process-and-Outcome Reward Hybridization

Qi Zhang | Shouqing Yang | Lirong Gao | Hao Chen | Xiaomeng Hu | Jinglei Chen | Jiexiang Wang | Sheng Guo | Bo Zheng | Haobo Wang | Junbo Zhao |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |