NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
CommonCOW: Massively Huge Web Corpora from CommonCrawl Data and a Method to Distribute them Freely under Restrictive EU Copyright Laws
Roland Schäfer
|
Paper Details:
Month: May
Year: 2016
Location: Portorož, Slovenia
Venue:
LREC |
Citations
URL
No Citations Yet
http://corporafromtheweb.org/
http://texrex.sourceforge.net/
http://slurm.schedmd.com/
https://hadoop.apache.org/
https://commoncrawl.org/
http://aws.amazon.com/de/
https://en.wikipedia.org/
http://creativecommons.org/licenses/by/
http://www.crummy.com/software/
https://code.google.com/archive/p/
Field Of Study
Linguistic Trends
Morphology
Task
Morphological Analysis
Named Entity Recognition
Language
English
Spanish
French
Similar Papers
Coordination and context-dependence in the generation of embodied conversation
Justine Cassell
|
Matthew Stone
|
Hao Yan
|
Ends-based Dialogue Processing
Jan Alexandersson
|
Tilman Becker
|
Ralf Engel
|
Markus Löckelt
|
Elsa Pecourt
|
Peter Poller
|
Norbert Pfleger
|
Norbert Reithinger
|
Gesture Theory is Linguistics: On Modelling Multimodality as Prosody
Dafydd Gibbon
|
The OTIM Formal Annotation Model: A Preliminary Step before Annotation Scheme
Philippe Blache
|
Roxane Bertrand
|
Mathilde Guardiola
|
Marie-Laure Guénot
|
Christine Meunier
|
Irina Nesterenko
|
Berthille Pallaud
|
Laurent Prévot
|
Béatrice Priego-Valverde
|
Stéphane Rauzy
|
THE INTONATIONAL STRUCTURING OF DISCOURSE
Julia Hirschberg
|
Janet Pierrehumbert
|