From Reasoning to Answer: Empirical, Attention-Based and Mechanistic Insights into Distilled DeepSeek R1 Models

Jue Zhang | Qingwei Lin | Saravan Rajmohan | Dongmei Zhang |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |