NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes
Kosuke Nishida
|
Kyosuke Nishida
|
Kuniko Saito
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/mcdm/CommitmentBank/
https://pytorch.org/
https://github.com/huggingface/transformers
https://github.com/mosaicml/llm-foundry
Field Of Study