NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Language ID in the Wild: Unexpected Challenges on the Path to a Thousand-Language Web Text Corpus
Isaac Caswell
|
Theresa Breiner
|
Daan van Esch
|
Ankur Bapna
|
Paper Details:
Month: December
Year: 2020
Location: Barcelona, Spain (Online)
Venue:
COLING |
Citations
URL
No Citations Yet
https://github.com/google-research-datasets/TF-IDF-IIF-top100-wordlists
https://github.com/google/corpuscrawler
https://slate.com/culture/2020/05/
http://www.dldp.eu/en/content/digital-language-survival-kit
Field Of Study