Improving Reasoning Capabilities in Small Models through Mixture-of-layers Distillation with Stepwise Attention on Key Information
Yao Chen |
Jiawei Sheng |
Wenyuan Zhang |
Tingwen Liu |
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
No URLs Found
Field Of Study