MuTIS: Enhancing Reasoning Efficiency through Multi Turn Intervention Sampling in Reinforcement Learning

Wenshuo Zhao | Haoxing Zhai | Xinyu Qiu | Zhenting Qi | Shuhe Li | Linchao Zhu |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |