NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Thinking with DistilQwen: A Tale of Four Distilled Reasoning and Reward Model Series
Wenrui Cai
|
Chengyu Wang
|
Junbing Yan
|
Jun Huang
|
Xiangzhong Fang
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou (China)
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/
https://huggingface.co/datasets
https://modelscope.cn/datasets
https://huggingface.co/datasets/
https://huggingface.co/datasets/
https://huggingface.co/deepseek-ai/
https://huggingface.co/Qwen/QwQ-32B
https://qwenlm.github.io/blog/qwen3/
https://artofproblemsolving.com/wiki/
https://huggingface.co/datasets/
https://huggingface.co/open-thoughts/
https://huggingface.co/datasets/
Field Of Study