An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning
Wei Sun |
Qianlong Du |
Fuwei Cui |
Jiajun Zhang |
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |