NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Michael Y. Hu
|
Aaron Mueller
|
Candace Ross
|
Adina Williams
|
Tal Linzen
|
Chengxu Zhuang
|
Ryan Cotterell
|
Leshem Choshen
|
Alex Warstadt
|
Ethan Gotlieb Wilcox
|
Paper Details:
Month: November
Year: 2024
Location: Miami, FL, USA
Venue:
CoNLL |
BabyLM |
WS |
Citations
URL
No Citations Yet
https://osf.io/ad7qg/
https://dumps.wikimedia.org/simplewiki/
https://github.com/babylm/babylm_data_
https://github.com/babylm/
https://github.com/huggingface/transformers/
https://github.com/phueb/BabyBERTa/blob/master/data/corpora/aochildes.txt
https://gutenberg.org/
http://opensubtitles.org/
Field Of Study