NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Language Identification for Creating Language-Specific Twitter Collections
Shane Bergsma
|
Paul McNamee
|
Mossaab Bagdouri
|
Clayton Fink
|
Theresa Wilson
|
Paper Details:
Month: June
Year: 2012
Location: Montréal, Canada
Venue:
WS |
Citations
URL
Time-Independent and Language-Independent Extraction of Multiword Expressions From Twitter
Nikhil Londhe
|
Rohini Srihari
|
Vishrawas Gopalakrishnan
|
Word Level Language Identification in Online Multilingual Communication
Dong Nguyen
|
A. Seza Doğruöz
|
Exploring Demographic Language Variations to Improve Multilingual Sentiment Analysis in Social Media
Svitlana Volkova
|
Theresa Wilson
|
David Yarowsky
|
What Your Username Says About You
Aaron Jaech
|
Mari Ostendorf
|
Fluency detection on communication networks
Tom Lippincott
|
Benjamin Van Durme
|
Mining Parallel Corpora from Sina Weibo and Twitter
Wang Ling
|
Luís Marujo
|
Chris Dyer
|
Alan W. Black
|
Isabel Trancoso
|
What to do about bad language on the internet
Jacob Eisenstein
|
Broadly Improving User Classification via Communication-Based Name and Location Clustering on Twitter
Shane Bergsma
|
Mark Dredze
|
Benjamin Van Durme
|
Theresa Wilson
|
David Yarowsky
|
Using Conceptual Class Attributes to Characterize Social Media Users
Shane Bergsma
|
Benjamin Van Durme
|
Exploring Sentiment in Social Media: Bootstrapping Subjectivity Clues from Multilingual Twitter Streams
Svitlana Volkova
|
Theresa Wilson
|
David Yarowsky
|
I’m a Belieber: Social Roles via Self-identification and Conceptual Attributes
Charley Beller
|
Rebecca Knowles
|
Craig Harman
|
Shane Bergsma
|
Margaret Mitchell
|
Benjamin Van Durme
|
Estimating Code-Switching on Twitter with a Novel Generalized Word-Level Language Detection Technique
Shruti Rijhwani
|
Royal Sequiera
|
Monojit Choudhury
|
Kalika Bali
|
Chandra Shekhar Maddila
|
Incorporating Dialectal Variability for Socially Equitable Language Identification
David Jurgens
|
Yulia Tsvetkov
|
Dan Jurafsky
|
Automatic Detection and Language Identification of Multilingual Documents
Marco Lui
|
Jey Han Lau
|
Timothy Baldwin
|
Tweet Conversation Annotation Tool with a Focus on an Arabic Dialect, Moroccan Darija
Stephen Tratz
|
Douglas Briesch
|
Jamal Laoudi
|
Clare Voss
|
Code Mixing: A Challenge for Language Identification in the Language of Social Media
Utsab Barman
|
Amitava Das
|
Joachim Wagner
|
Jennifer Foster
|
Twitter Users #CodeSwitch Hashtags! #MoltoImportante #wow
David Jurgens
|
Stefan Dimitrov
|
Derek Ruths
|
Language variety identification in Spanish tweets
Wolfgang Maier
|
Carlos Gómez-Rodríguez
|
“ye word kis lang ka hai bhai?” Testing the Limits of Word level Language Identification
Spandana Gella
|
Kalika Bali
|
Monojit Choudhury
|
Experiments in Sentence Language Identification with Groups of Similar Languages
Ben King
|
Dragomir Radev
|
Steven Abney
|
Developing Language-tagged Corpora for Code-switching Tweets
Suraj Maharjan
|
Elizabeth Blair
|
Steven Bethard
|
Thamar Solorio
|
A Language Detection System for Short Chats in Mobile Games
Pidong Wang
|
Nikhil Bojja
|
Shivasankari Kannan
|
Subdialectal Differences in Sorani Kurdish
Shervin Malmasi
|
Language and Dialect Discrimination Using Compression-Inspired Language Models
Paul McNamee
|
Hierarchical Character-Word Models for Language Identification
Aaron Jaech
|
George Mulcaire
|
Shobhit Hathi
|
Mari Ostendorf
|
Noah A. Smith
|
A Dataset and Classifier for Recognizing Social Media English
Su Lin Blodgett
|
Johnny Wei
|
Brendan O’Connor
|
Iterative Language Model Adaptation for Indo-Aryan Language Identification
Tommi Jauhiainen
|
Heidi Jauhiainen
|
Krister Lindén
|
Convolutions Are All You Need (For Classifying Character Sequences)
Zach Wood-Doughty
|
Nicholas Andrews
|
Mark Dredze
|
http://mashable.com/2011/09/08/
http://apl.jhu.edu/
http://odur.let.rug.nl/vannoord/TextCat/
http://code.google.com/p/
https://github.com/saffsd/langid.py
http://trec.nist.gov/data/tweets/
http://meta.wikimedia.org/wiki/List_of_
http://apl.jhu.edu/
Field Of Study
Task
Language Identification
Text Categorization
Authorship Attribution
Language
Multilingual
Chinese
English
Hindi
Urdu
Japanese
Korean
Spanish
Arabic
Dataset
News
Social Media
Twitter
Similar Papers
Natural Language Processing for Dialectical Arabic: A Survey
Abdulhadi Shoufan
|
Sumaya Alameri
|
Sentiment after Translation: A Case-Study on Arabic Social Media Posts
Mohammad Salameh
|
Saif Mohammad
|
Svetlana Kiritchenko
|
Bringing replication and reproduction together with generalisability in NLP: Three reproduction studies for Target Dependent Sentiment Analysis
Andrew Moore
|
Paul Rayson
|
Semi-supervised Structured Prediction with Neural CRF Autoencoder
Xiao Zhang
|
Yong Jiang
|
Hao Peng
|
Kewei Tu
|
Dan Goldwasser
|
A Survey of Arabic Named Entity Recognition and Classification
Khaled Shaalan
|