What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Ming Li | Yanhong Li | Tianyi Zhou |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |