NLPExplorer
  • Papers
  • Venues
  • Authors
  • Authors Timeline
  • Field of Study
  • URLs
  • ACL N-gram Stats
  • TweeNLP
  • API
  • Team

WAC - 2014

Total Papers:- 7
Total Papers accross all years:- 59
Total Citations :- 35
1
{bs,hr,sr}WaC - Web Corpora of Bosnian, Croatian and Serbian
Nikola Ljubešić | Filip Klubička |


The PAISÀ Corpus of Italian Web Texts
Verena Lyding | Egon Stemle | Claudia Borghetti | Marco Brunello | Sara Castagnoli | Felice Dell’Orletta | Henrik Dittmann | Alessandro Lenci | Vito Pirrelli |


Finding Viable Seed URLs for Web Corpora: A Scouting Approach and Comparative Study of Available Sources
Adrien Barbaresi |


Focused Web Corpus Crawling
Roland Schäfer | Adrien Barbaresi | Felix Bildhauer |


Less Destructive Cleaning of Web Documents by Using Standoff Annotation
Maik Stührenberg |


Some Issues on the Normalization of a Corpus of Products Reviews in Portuguese
Magali Sanches Duran | Lucas Avanço | Sandra Aluísio | Thiago Pardo | Maria da Graça Volpe Nunes |


Proceedings of the 9th Web as Corpus Workshop (WaC-9)
Felix Bildhauer | Roland Schäfer |


Conference Topic Distribution

Linguistic Task Approach Language Dataset

Conference Citation Distribution

Conference Citations Yearwise Conference Citations

Conference Reference Distribution

Conference References Yearwise Conference References

Topics

Linguistic Trends
Syntax Typology
Task
Corpus Annotation Language Identification Tagging Named Entity Recognition Summarization Machine Translation Biomedical
Language
Multilingual Chinese English French
Dataset
News Encyclopedia Social Media Web Crawl