Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes

Kosuke Nishida | Kyosuke Nishida | Kuniko Saito |

Paper Details:

Month: November
Year: 2024
Location: Miami, Florida, USA
Venue: EMNLP |