NLPExplorer
  • Papers
  • Venues
  • Authors
  • Authors Timeline
  • Field of Study
  • URLs
  • ACL N-gram Stats
  • TweeNLP
  • API
  • Team

WAC - 2008

Total Papers:- 10
Total Papers accross all years:- 59
Total Citations :- 0
1
GlossaNet 2: a linguistic search engine for RSS-based corpora
Cédrick Fairon | Kévin Macé | Hubert Naets |


Introducing, evaluating ukWaC, a very large web-derived corpus of English
Adriano Ferraresi | Eros Zanchetta | Marco Baroni | Silvia Bernardini |


Identification of Duplicate News Stories in Web Pages
John Gibson | Ben Wellner | Susan Lubar |


Collecting Basque specialized corpora from the web: language-specific performance tweaks, improving topic precision
I. Leturia | I. San Vicente | X. Saralegi | M. Lopez de Lacalle |


RoDEO: Reasoning over Dependencies Extracted Online
Reda Siblini | Leila Kosseim |


Victor: the Web-Page Cleaning Tool
Miroslav Spousta | Michal Marek | Pavel Pecina |


Google for the Linguist on a Budget
András Kornai | Péter Halácsy |


Proceedings of the 4th Web as Corpus Workshop
Stefan Evert | Adam Kilgarriff | Serge Sharoff |


Reranking Google with GReG
Rodolfo Delmonte | Marco Aldo Piccolino Boniforti |


Segmenting HTML pages using visual, semantic information
Georgios Petasis | Pavlina Fragkou | Aris Theodorakos | Vangelis Karkaletsis | Constantine D. Spyropoulos |


Conference Topic Distribution

Linguistic Task Approach Language Dataset

Conference Citation Distribution

Conference Papers have no Citations yet

Topics