NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Alex Warstadt
|
Aaron Mueller
|
Leshem Choshen
|
Ethan Wilcox
|
Chengxu Zhuang
|
Juan Ciro
|
Rafael Mosquera
|
Bhargavi Paranjabe
|
Adina Williams
|
Tal Linzen
|
Ryan Cotterell
|
Paper Details:
Month: December
Year: 2023
Location: Singapore
Venue:
CoNLL |
Citations
URL
No Citations Yet
https://github.com/
http://www.natcorp.ox.ac.uk
https://dumps.wikimedia.org/
https://dumps.wikimedia.org/simplewiki/20221201/
https://github.com/babylm/babylm_data_
https://dynabench.org/
https://github.com/babylm/
https://github.com/huggingface/transformers/
https://dynabench.org/tasks/baby_strict
https://dynabench.org/tasks/baby_strict_small
https://dynabench.org/tasks/baby_loose
https://dynabench.org/babylm
https://github.com/
https://github.com/bigscience-workshop/
https://github.com/phueb/BabyBERTa/blob/master/data/corpora/aochildes.txt
https://gutenberg.org/
http://opensubtitles.org/
Field Of Study