NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Cleaneval: a Competition for Cleaning Web Pages
Marco Baroni
|
Francis Chantree
|
Adam Kilgarriff
|
Serge Sharoff
|
Paper Details:
Year: 2008
Venue:
LREC |
Citations
URL
A modular open-source focused crawler for mining monolingual and bilingual corpora from the web
Vassilis Papavassiliou
|
Prokopis Prokopidis
|
Gregor Thurmair
|
http://cleaneval.sigwac.org.uk/
http://notepad-plus.sourceforge.net/
Field Of Study
Task
Language Generation
Language
Chinese
English
Dataset
News
Similar Papers
Character-Aware Neural Networks for Arabic Named Entity Recognition for Social Media
Mourad Gridach
|
Expectation-Regulated Neural Model for Event Mention Extraction
Ching-Yun Chang
|
Zhiyang Teng
|
Yue Zhang
|
A Survey of Arabic Named Entity Recognition and Classification
Khaled Shaalan
|
Bringing replication and reproduction together with generalisability in NLP: Three reproduction studies for Target Dependent Sentiment Analysis
Andrew Moore
|
Paul Rayson
|
A Joint Model of Conversational Discourse Latent Topics on Microblogs
Jing Li
|
Yan Song
|
Zhongyu Wei
|
Kam-Fai Wong
|