NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Data, Data Everywhere: A Guide for Pretraining Dataset Construction
Jupinder Parmar
|
Shrimai Prabhumoye
|
Joseph Jennings
|
Bo Liu
|
Aastha Jhunjhunwala
|
Zhilin Wang
|
Mostofa Patwary
|
Mohammad Shoeybi
|
Bryan Catanzaro
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://perspectiveapi.com/
https://www.kaggle.com/c/jigsaw-toxic-comment-
https://www.kaggle.com/competitions/jigsaw-
https://cloud.google.com/natural-
https://pypi.org/project/Wikipedia-API/
https://huggingface.co/nvidia/domain-classifier
https://www.surgehq.ai/
Field Of Study