NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Labeling the Languages of Words in Mixed-Language Documents using Weakly Supervised Methods
Ben King
|
Steven Abney
|
Paper Details:
Month: June
Year: 2013
Location: Atlanta, Georgia
Venue:
NAACL |
Citations
URL
Challenges of language technologies for the indigenous languages of the Americas
Manuel Mager
|
Ximena Gutierrez-Vasques
|
Gerardo Sierra
|
Ivan Meza-Ruiz
|
Word Level Language Identification in Online Multilingual Communication
Dong Nguyen
|
A. Seza Doğruöz
|
POS Tagging of English-Hindi Code-Mixed Social Media Content
Yogarshi Vyas
|
Spandana Gella
|
Jatin Sharma
|
Kalika Bali
|
Monojit Choudhury
|
A Fast, Compact, Accurate Model for Language Identification of Codemixed Text
Yuan Zhang
|
Jason Riesa
|
Daniel Gillick
|
Anton Bakalov
|
Jason Baldridge
|
David Weiss
|
LanideNN: Multilingual Language Identification on Text Stream
Tom Kocmi
|
Ondřej Bojar
|
Combining Lightly-Supervised Text Classification Models for Accurate Contextual Advertising
Yiping Jin
|
Dittaya Wanvarie
|
Phu Le
|
An Arabic-Moroccan Darija Code-Switched Corpus
Younes Samih
|
Wolfgang Maier
|
Inferring latent attributes of Twitter users with label regularization
Ehsan Mohammady Ardehaly
|
Aron Culotta
|
Learning Polylingual Topic Models from Code-Switched Social Media Documents
Nanyun Peng
|
Yiming Wang
|
Mark Dredze
|
Estimating Code-Switching on Twitter with a Novel Generalized Word-Level Language Detection Technique
Shruti Rijhwani
|
Royal Sequiera
|
Monojit Choudhury
|
Kalika Bali
|
Chandra Shekhar Maddila
|
Automatic Detection and Language Identification of Multilingual Documents
Marco Lui
|
Jey Han Lau
|
Timothy Baldwin
|
Finding Viable Seed URLs for Web Corpora: A Scouting Approach and Comparative Study of Available Sources
Adrien Barbaresi
|
Code Mixing: A Challenge for Language Identification in the Language of Social Media
Utsab Barman
|
Amitava Das
|
Joachim Wagner
|
Jennifer Foster
|
Predicting Code-switching in Multilingual Communication for Immigrant Communities
Evangelos Papalexakis
|
Dong Nguyen
|
A. Seza Doğruöz
|
Overview for the First Shared Task on Language Identification in Code-Switched Data
Thamar Solorio
|
Elizabeth Blair
|
Suraj Maharjan
|
Steven Bethard
|
Mona Diab
|
Mahmoud Ghoneim
|
Abdelati Hawwari
|
Fahad AlGhamdi
|
Julia Hirschberg
|
Alison Chang
|
Pascale Fung
|
Word-level Language Identification using CRF: Code-switching Shared Task Report of MSR India System
Gokul Chittaranjan
|
Yogarshi Vyas
|
Kalika Bali
|
Monojit Choudhury
|
The CMU Submission for the Shared Task on Language Identification in Code-Switched Data
Chu-Cheng Lin
|
Waleed Ammar
|
Lori Levin
|
Chris Dyer
|
Language Identification in Code-Switching Scenario
Naman Jain
|
Riyaz Ahmad Bhat
|
The IUCL+ System: Word-Level Language Identification via Extended Markov Models
Levi King
|
Eric Baucom
|
Timur Gilmanov
|
Sandra Kübler
|
Dan Whyatt
|
Wolfgang Maier
|
Paul Rodrigues
|
Mixed Language and Code-Switching in the Canadian Hansard
Marine Carpuat
|
DCU-UVT: Word-Level Language Classification with Code-Mixed Data
Utsab Barman
|
Joachim Wagner
|
Grzegorz Chrupała
|
Jennifer Foster
|
Adapting Predicate Frames for Urdu PropBanking
Riyaz Ahmad Bhat
|
Naman Jain
|
Ashwini Vaidya
|
Martha Palmer
|
Tafseer Ahmed Khan
|
Dipti Misra Sharma
|
James Babani
|
Subsegmental language detection in Celtic language text
Akshay Minocha
|
Francis Tyers
|
“ye word kis lang ka hai bhai?” Testing the Limits of Word level Language Identification
Spandana Gella
|
Kalika Bali
|
Monojit Choudhury
|
Identifying Languages at the Word Level in Code-Mixed Indian Social Media Text
Amitava Das
|
Björn Gambäck
|
Experiments in Sentence Language Identification with Groups of Similar Languages
Ben King
|
Dragomir Radev
|
Steven Abney
|
Developing Language-tagged Corpora for Code-switching Tweets
Suraj Maharjan
|
Elizabeth Blair
|
Steven Bethard
|
Thamar Solorio
|
POS Tagging of Hindi-English Code Mixed Text from Social Media: Some Machine Learning Experiments
Royal Sequiera
|
Monojit Choudhury
|
Kalika Bali
|
Leveraging Data-Driven Methods in Word-Level Language Identification for a Multilingual Alpine Heritage Corpus
Ada Wan
|
Simple Tools for Exploring Variation in Code-switching for Linguists
Gualberto A. Guzman
|
Jacqueline Serigos
|
Barbara E. Bullock
|
Almeida Jacqueline Toribio
|
Accurate Pinyin-English Codeswitched Language Identification
Meng Xuan Xia
|
Jackie Chi Kit Cheung
|
Codeswitching language identification using Subword Information Enriched Word Vectors
Meng Xuan Xia
|
Hierarchical Character-Word Models for Language Identification
Aaron Jaech
|
George Mulcaire
|
Shobhit Hathi
|
Mari Ostendorf
|
Noah A. Smith
|
A Dataset and Classifier for Recognizing Social Media English
Su Lin Blodgett
|
Johnny Wei
|
Brendan O’Connor
|
Towards Normalising Konkani-English Code-Mixed Social Media Text
Akshata Phadte
|
Gaurish Thakkar
|
Language Informed Modeling of Code-Switched Text
Khyathi Chandu
|
Thomas Manzini
|
Sumeet Singh
|
Alan W. Black
|
Automatic Token and Turn Level Language Identification for Code-Switched Text Dialog: An Analysis Across Language Pairs and Corpora
Vikram Ramanarayanan
|
Robert Pugh
|
Language Identification in Code-Mixed Data using Multichannel Neural Networks and Context Capture
Soumil Mandal
|
Anil Kumar Singh
|
Word-level Language Identification in Bi-lingual Code-switched Texts
Harsh Jhamtani
|
Suleep Kumar Bhogi
|
Vaskar Raychoudhury
|
http://www-personal.umich.edu/
http://www.facebook.com/taole
http://www.unicode.org/udhr/
http://meta.wikimedia.org/wiki/List
http://www.watchtower.org
http://mallet.cs.umass.edu
Field Of Study
Task
Language Identification
Named Entity Recognition
Information Retrieval
Approach
Generative Model
Unsupervised Learning
Language
Multilingual
English
Similar Papers
Enriching Word Vectors with Subword Information
Piotr Bojanowski
|
Edouard Grave
|
Armand Joulin
|
Tomas Mikolov
|
Natural Language Processing for Dialectical Arabic: A Survey
Abdulhadi Shoufan
|
Sumaya Alameri
|
Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data
Mona Diab
|
Mahmoud Ghoneim
|
Abdelati Hawwari
|
Fahad AlGhamdi
|
Nada AlMarwani
|
Mohamed Al-Badrashiny
|
Semi-supervised Structured Prediction with Neural CRF Autoencoder
Xiao Zhang
|
Yong Jiang
|
Hao Peng
|
Kewei Tu
|
Dan Goldwasser
|
A Survey of Arabic Named Entity Recognition and Classification
Khaled Shaalan
|