Improving Reasoning Capabilities in Small Models through Mixture-of-layers Distillation with Stepwise Attention on Key Information

Yao Chen | Jiawei Sheng | Wenyuan Zhang | Tingwen Liu |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |

Citations

URL

No Citations Yet

No URLs Found

Field Of Study