NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
WAC - 2014
Total Papers:- 7
Total Papers accross all years:- 49
Total Citations :- 35
1
{bs,hr,sr}WaC - Web Corpora of Bosnian, Croatian and Serbian
Nikola Ljubešić |
Filip Klubička |
The PAISÀ Corpus of Italian Web Texts
Verena Lyding |
Egon Stemle |
Claudia Borghetti |
Marco Brunello |
Sara Castagnoli |
Felice Dell’Orletta |
Henrik Dittmann |
Alessandro Lenci |
Vito Pirrelli |
Finding Viable Seed URLs for Web Corpora: A Scouting Approach and Comparative Study of Available Sources
Adrien Barbaresi |
Focused Web Corpus Crawling
Roland Schäfer |
Adrien Barbaresi |
Felix Bildhauer |
Less Destructive Cleaning of Web Documents by Using Standoff Annotation
Maik Stührenberg |
Some Issues on the Normalization of a Corpus of Products Reviews in Portuguese
Magali Sanches Duran |
Lucas Avanço |
Sandra Aluísio |
Thiago Pardo |
Maria da Graça Volpe Nunes |
Proceedings of the 9th Web as Corpus Workshop (WaC-9)
Felix Bildhauer |
Roland Schäfer |
Conference Topic Distribution
Linguistic
Task
Approach
Language
Dataset
Conference Citation Distribution
Conference Citations
Yearwise Conference Citations
Conference Reference Distribution
Conference References
Yearwise Conference References
Topics
Linguistic Trends
Syntax
Typology
Task
Corpus Annotation
Language Identification
Tagging
Named Entity Recognition
Summarization
Machine Translation
Biomedical
Language
Multilingual
Chinese
English
French
Dataset
News
Encyclopedia
Social Media
Web Crawl