Spectral Scaling Laws in Language Models: emphHow Effectively Do Feed-Forward Networks Use Their Latent Space?
Nandan Kumar Jha |
Brandon Reagen |
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
No URLs Found
Field Of Study