Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs

Mengqi Liao | Xiangyu Xi | Chen Ruinian | Jia Leng | Yangen Hu | Ke Zeng | Shuai Liu | Huaiyu Wan |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |