NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs
Mengqi Liao
|
Xiangyu Xi
|
Chen Ruinian
|
Jia Leng
|
Yangen Hu
|
Ke Zeng
|
Shuai Liu
|
Huaiyu Wan
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://pretty-radio-b75.notion.site/DeepScaleR-
https://artofproblemsolving.com/wiki/
Field Of Study