NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Intersecting Register and Genre: Understanding the Contents of Web-Crawled Corpora
Amanda Myntti
|
Liina Repo
|
Elian Freyermuth
|
Antti Kanner
|
Veronika Laippala
|
Erik Henriksson
|
Paper Details:
Month: November
Year: 2024
Location: Miami, USA
Venue:
NLP4DH |
WS |
Citations
URL
No Citations Yet
https://github.com/
https://huggingface.co/datasets/
https://commoncrawl.org
https://huggingface.co/datasets/TurkuNLP/
https://radimrehurek.com/gensim/models/
https://www.nltk.org/
https://scikit-learn.org/stable/modules/
https://github.com/cleanlab/cleanlab
https://openai.com/chatgpt/
https://laion.ai/
https://www.ontocord.ai/
Field Of Study