TreeRL: LLM Reinforcement Learning with On-Policy Tree Search

Zhenyu Hou | Ziniu Hu | Yujiang Li | Rui Lu | Jie Tang | Yuxiao Dong |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |

Citations

URL