NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
SoMaJo: State-of-the-art tokenization for German web and social media texts
Thomas Proisl
|
Peter Uhrig
|
Paper Details:
Month: August
Year: 2016
Location: Berlin
Venue:
WAC |
WS |
SIG: SIGWAC
Citations
URL
SoMeWeTa: A Part-of-Speech Tagger for German Social Media and Web Texts
Thomas Proisl
|
A corpus of German political speeches from the 21st century
Adrien Barbaresi
|
EmpiriST 2015: A Shared Task on the Automatic Linguistic Annotation of Computer-Mediated Communication and Web Corpora
Michael Beißwenger
|
Sabine Bartsch
|
Stefan Evert
|
Kay-Michael Würzner
|
An Unsupervised Morphological Criterion for Discriminating Similar Languages
Adrien Barbaresi
|
Discriminating between Similar Languages using Weighted Subword Features
Adrien Barbaresi
|
Computationally efficient discrimination between language varieties with large feature vectors and regularized classifiers
Adrien Barbaresi
|
EmotiKLUE at IEST 2018: Topic-Informed Classification of Implicit Emotions
Thomas Proisl
|
Philipp Heinrich
|
Besim Kabashi
|
Stefan Evert
|
https://pypi.python.org/pypi/SoMaJo
https://www.cis.upenn.edu/~treebank/
https://opennlp.apache.org/
http://www.regular-expressions.info/
http://www.chatvongesternnacht.de
http://c2.com/cgi/wiki?WikiWikiWeb
https://de.wiktionary.org/
https://sites.google.com/site/
Field Of Study
Linguistic Trends
Syntax
Task
Tagging
Language
English
Dataset
Social Media
Twitter
Similar Papers
A Corpus of Corporate Annual and Social Responsibility Reports: 280 Million Tokens of Balanced Organizational Writing
Sebastian G.M. Händschke
|
Sven Buechel
|
Jan Goldenstein
|
Philipp Poschmann
|
Tinghui Duan
|
Peter Walgenbach
|
Udo Hahn
|
Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis.
Diana Maynard
|
Mark Greenwood
|
Indicative Tweet Generation: An Extractive Summarization Problem?
Priya Sidhaye
|
Jackie Chi Kit Cheung
|
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
Kalina Bontcheva
|
Leon Derczynski
|
Adam Funk
|
Mark Greenwood
|
Diana Maynard
|
Niraj Aswani
|
Argumentation Mining in User-Generated Web Discourse
Ivan Habernal
|
Iryna Gurevych
|