NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization
Oana Frunza
|
Paper Details:
Year: 2008
Venue:
LREC |
Citations
URL
No Citations Yet
http://aune.lpl.univ-aix.fr/projects/multext/MtSeg/
http://balie.sourceforge.net/
http://nl.ijs.si/ME/V3/doc/
http://www.cs.waikato.ac.nz/ml/weka/
http://balie.sourceforge.net
Field Of Study
Task
Tagging
Information Retrieval
Language
English
French
Similar Papers
Indicative Tweet Generation: An Extractive Summarization Problem?
Priya Sidhaye
|
Jackie Chi Kit Cheung
|
Modeling Speech Acts in Asynchronous Conversations: A Neural-CRF Approach
Shafiq Joty
|
Tasnim Mohiuddin
|
A Corpus of Corporate Annual and Social Responsibility Reports: 280 Million Tokens of Balanced Organizational Writing
Sebastian G.M. Händschke
|
Sven Buechel
|
Jan Goldenstein
|
Philipp Poschmann
|
Tinghui Duan
|
Peter Walgenbach
|
Udo Hahn
|
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
Kalina Bontcheva
|
Leon Derczynski
|
Adam Funk
|
Mark Greenwood
|
Diana Maynard
|
Niraj Aswani
|
Argumentation Mining in User-Generated Web Discourse
Ivan Habernal
|
Iryna Gurevych
|