Carnegie Mellon University (CMU) is a private research university in Pittsburgh, Pennsylvania, United States. The institution was established in 1900 by Andrew Carnegie as the Carnegie Technical Schools. In 1912, it became the Carnegie Institute of Technology and began granting four-year degrees. In 1967, it became Carnegie Mellon University through its merger with the Mellon Institute of Industrial Research, founded in 1913 by Andrew Mellon and Richard B. Mellon and formerly a part of the University of Pittsburgh.

[mdr] Une analyse préliminaire du rire chez des enfants de 18 à 36 mois ([lol]: a preliminary study of laughter in 18- to 36- month old children) [in French]
Relation Classification via Multi-Level Attention CNNs
The Role of Qualia Structure in Mandarin Children Acquiring Noun-modifying Constructions
Improving the neural network-based machine transliteration for low-resourced language pair
Arabic Data Science Toolkit: An API for Arabic Language Feature Extraction
SemEval-2016 Task 10: Detecting Minimal Semantic Units and their Meanings (DiMSUM)
Affective Common Sense Knowledge Acquisition for Sentiment Analysis
Special Session - The Future Directions of Dialogue-Based Intelligent Personal Assistants
Building Practical Spoken Dialog Systems
Advances in meeting recognition
Evaluation for Scenario Question Answering Systems
Discovering Causal Relations in Textual Instructions
SIDE: The Summarization Integrated Development Environment
Retrofitting Word Vectors to Semantic Lexicons
Community Evaluation and Exchange of Word Vectors at
Sparse Overcomplete Word Vector Representations
Early Gains Matter: A Case for Preferring Generative over Discriminative Crowdsourcing Models
Semi-supervised Learning of Naive Bayes Classifier with feature constraints
A Virtual Manipulative for Learning Log-Linear Models
Multilingual Open Relation Extraction Using Cross-lingual Projection
Examining the Relationship between Preordering and Word Order Freedom in Machine Translation
The CMU Machine Translation Systems at WMT 2013: Syntax, Synthetic Translation Options, and Pseudo-References
The Structure and Generality of Spoken Route Instructions,,~dougb/ident.html
Mining the Web for Bilingual Text
Domain Portability in Speech-to-Speech Translation
The Creation of a Corpus of English Metalanguage
Toward Automatic Processing of English Metalanguage
Meteor Universal: Language Specific Translation Evaluation for Any Target Language
Automatic Measurement of Syntactic Development in Child Language
The C-ORAL-ROM CORPUS. A Multilingual Resource of Spontaneous Speech for Romance Languages
Web Mining for Unsupervised Classification
Exploring the Use of Word Relation Features for Sentiment Classification
That’s So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using #petpeeve Tweets
Integrating Linguistic Resources: The American National Corpus Model
Exploiting Semantic Web Technologies for Intelligent Access to Historical Documents
Dynamic Language Models for Streaming Text
Parametric Models of Linguistic Count Data
Speech to Speech Translation for Medical Triage in Korean
Language Model Adaptation for Statistical Machine Translation via Structured Query Models
Coupling Semi-Supervised Learning of Categories and Relations
Comprehensive Annotation of Multiword Expressions in a Social Web Corpus
Concept Classification with Bayesian Multi-task Learning
Echoes of Persuasion: The Effect of Euphony in Persuasive Communication
KU Leuven at HOO-2012: A Hybrid Approach to Detection and Correction of Determiner and Preposition Errors in Non-native English Text
G2P Conversion of Proper Names Using Word Origin Information
Pronunciation Modeling in Spelling Correction for Writers of English as a Foreign Language
Idiom Savant at Semeval-2017 Task 7: Detection and Interpretation of English Puns
Robust Dictionary Lookup in Multiple Noisy Orthographies
X575: Writing rengas with web services
A Text Normalisation System for Non-Standard English Words
Native Language Identification using Phonetic Algorithms
An Unsupervised Model for Text Message Normalization
Deep-speare: A joint neural model of poetic language, meter and rhyme
Optimal Data Set Selection: An Application to Grapheme-to-Phoneme Conversion
Free English and Czech telephone speech corpus shared under the CC-BY-SA 3.0 license
Automation and Evaluation of the Keyword Method for Second Language Learning
Pronunciation Variants and ASR of Colloquial Speech: A Case Study on Czech
Classifying Recognition Results for Spoken Dialog Systems
Construction and Analysis of Word-level Time-aligned Simultaneous Interpretation Corpus
Grapheme-to-Phoneme Models for (Almost) Any Language$\sim$lemur
Extracting Parallel Sub-Sentential Fragments from Non-Parallel Corpora
When is an Embedded MT System “Good Enough” for Filtering?
Regular Expression Guided Entity Mention Mining from Noisy Web Data
Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters
Experiential, Distributional and Dependency-based Word Embeddings have Complementary Roles in Decoding Brain Activity
An Empirical Comparison Between N-gram and Syntactic Language Models for Word Ordering
Fluent Translations from Disfluent Speech in End-to-End Speech Translation
Learning Translation Rules for a Bidirectional English-Filipino Machine Translator
Computational simulations of second language construction learning
SXUCFN-Core: STS Models Integrating FrameNet Parsing Information
Any-language frame-semantic parsing
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogue Systems
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken Language Understanding
Unsupervised extractive summarization via coverage maximization with syntactic and semantic concepts
Jointly Modeling Inter-Slot Relations by Random Walk on Knowledge Graphs for Unsupervised Spoken Language Understanding
Combinaison de ressources générales pour une contextualisation implicite de requêtes (Query Contextualization and Reformulation by Combining External Corpora) [in French]
Research on a Model of Extracting Persons’ Information Based on Statistic Method and Conceptual Knowledge

Towards the Orwellian Nightmare: Separation of Business and Personal Emails
Distractorless Authorship Verification
Degrees of Orality in Speech-like Corpora: Comparative Annotation of Chat and E-mail Corpora
Extracting Social Power Relationships from Natural Language
Annotating Large Email Datasets for Named Entity Recognition with Mechanical Turk
Evaluating the Ontology underlying sMail - the Conceptual Framework for Semantic Email Communication
Plural Problems in the Nominal Morphology of Marathi
Nonlinear Evidence Fusion and Propagation for Hyponymy Relation Mining
Random Walk Inference and Learning in A Large Scale Knowledge Base
Documents and Dependencies: an Exploration of Vector Space Models for Semantic Composition
Corpus-based Semantic Class Mining: Distributional vs. Pattern-Based Approaches
Bootstrapping Biomedical Ontologies for Scientific Text using NELL
Discovering Relations between Noun Categories
N-Gram-Based Statistical Machine Translation versus Syntax Augmented Machine Translation: Comparison and System Combination
A Beam-Search Decoder for Normalization of Social Media Text with Application to Machine Translation
Extending Pronunciation Lexicons via Non-phonemic Respellings
Ensemble Methods for Native Language Identification
What Makes Writing Great? First Experiments on Article Quality Prediction in the Science Journalism Domain
ProPOSEL: a human-oriented prosody and PoS English lexicon for machine-learning and NLP
Transliteration Alignment
Report of NEWS 2011 Machine Transliteration Shared Task
Priming vs. Inhibition of Optional Infinitival “to”
The Gulf of Guinea Creole Corpora
Monolingual Distributional Profiles for Word Substitution in Machine Translation
Toward a Scoring Function for Quality-Driven Machine Translation
Selecting Corpus-Semantic Models for Neurolinguistic Decoding
A Modified Cosine-Similarity based Log Kernel for Support Vector Machines in the Domain of Text Classification
Automatic Selection of Context Configurations for Improved Class-Specific Word Representations
Learning Effective and Interpretable Semantic Models using Non-Negative Sparse Embedding
Benchmarking SMT Performance for Farsi Using the TEP++ Corpus
Hindi-to-Urdu Machine Translation through Transliteration
That’s Not What I Meant! Using Parsers to Avoid Structural Ambiguities in Generated Text
Quality Estimation for Synthetic Parallel Data Generation
The Operation Sequence Model—Combining N-Gram-Based and Phrase-Based Statistical Machine Translation
Automatic Evaluation of Commonsense Knowledge for Refining Japanese ConceptNet
Never-Ending Multiword Expressions Learning
Jointly Embedding Relations and Mentions for Knowledge Population
KGEval: Accuracy Estimation of Automatically Constructed Knowledge Graphs
Collectively Representing Semi-Structured Data from the Web
Construction of the Literature Graph in Semantic Scholar
Towards Never Ending Language Learning for Morphologically Rich Languages
Netgraph – Making Searching in Treebanks Easy
Learning Field Compatibilities to Extract Database Records from Unstructured Text
Is Unlabeled Data Suitable for Multiclass SVM-based Web Page Classification?
Comparing Triggering Policies for Social Behaviors
A Challenge Set for Advancing Language Modeling
Combining Neural and Non-Neural Methods for Low-Resource Morphological Reinflection
CIEMPIESS: A New Open-Sourced Mexican Spanish Radio Corpus
Cross-lingual Transfer of Correlations between Parts of Speech and Gaze Features
Weakly Supervised Part-of-speech Tagging Using Eye-tracking Data
Entropy-based Training Data Selection for Domain Adaptation
Computational Approaches to Sentence Completion
Development of Speech corpora for different Speech Recognition tasks in Malayalam language
If you can’t beat them, join them: the University of Alberta system description
Jointly Learning to Parse and Perceive: Connecting Natural Language to the Physical World
Unsupervised Relation Extraction of In-Domain Data from Focused Crawls
A Generative Entity-Mention Model for Linking Entities with Knowledge Base
Relating Simple Sentence Representations in Deep Neural Networks and the Brain
Coarse Lexical Semantic Annotation with Supersenses: An Arabic Case Study
Competitive Grouping in Integrated Phrase Segmentation and Alignment Model
Minimização do Impacto do Problema de Desvio de Conceito por Meio de Acoplamento em Ambiente de Aprendizado Sem Fim (Minimizing the Impact of the Concept Drift Problem by Using a Framework of Endless Learning) [in Portuguese]
Learning Translation Rules for a Bidirectional English-Filipino Machine Translator
Generative Topic Embedding: a Continuous Representation of Documents
Correction Annotation for Non-Native Arabic Texts: Guidelines and Corpus
Mapping Verbs in Different Languages to Knowledge Base Relations using Web Text as Interlingua
Textual Predictors of Bill Survival in Congressional Committees
Linguistic Structured Sparsity in Text Categorization
RtGender: A Corpus for Studying Differential Responses to Gender
A Walk on the Other Side: Using SMT Components in a Transfer-Based Translation System
Learning Effective and Interpretable Semantic Models using Non-Negative Sparse Embedding
Transliteration of Proper Names in Cross-Lingual Information Retrieval
Automatic Keyword Extraction on Twitter
Splusplus: A Feature-Rich Two-stage Classifier for Sentiment Analysis of Tweets
Named Entity Recognition with Long Short-Term Memory
Evaluating a Spoken Dialogue System that Detects and Adapts to User Affective States
Syllable weight encodes mostly the same information for English word segmentation as dictionary stress
Can Chinese Phonemes Improve Machine Transliteration?: A Comparative Study of English-to-Chinese Transliteration Models
Factors Influencing the Surprising Instability of Word Embeddings
Correcting General Purpose ASR Errors using Posteriors
Practical Evaluation of Speech Recognizers for Virtual Human Dialogue Systems
“Let Everything Turn Well in Your Wife”: Generation of Adult Humor Using Lexical Constraints
Uncertainty Corpus: Resource to Study User Affect in Complex Spoken Dialogue Systems
Semi-Supervised Frame-Semantic Parsing for Unknown Predicates
Frame-Semantic Parsing
Integrating lexicographic examples in a lexical network (Intégration relationnelle des exemples lexicographiques dans un réseau lexical) [in French]
Probabilistic Frame-Semantic Parsing
Semantic Frames to Predict Stock Price Movement
An Exact Dual Decomposition Algorithm for Shallow Semantic Parsing with Constraints
Statistical Models for Frame-Semantic Parsing
Using a Recurrent Neural Network Model for Classification of Tweets Conveyed Influenza-related Information
MMR-based Feature Selection for Text Categorization
學術會議資訊之擷取及其應用 (Information Extraction for Academic Conference and It’s Application) [In Chinese]
Domain Specific Speech Acts for Spoken Language Translation
Predicting Morphological Types of Chinese Bi-Character Words by Machine Learning Approaches
Incorporation of WordNet Features to n-gram Features in a Language Modeler
Event Coreference Resolution with Multi-Pass Sieves
GU-MLT-LT: Sentiment Analysis of Short Messages using Linguistic Features and Stochastic Gradient Descent
RTRGO: Enhancing the GU-MLT-LT System for Sentiment Analysis of Short Messages
Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters
A Simple Bayesian Modelling Approach to Event Extraction from Twitter
Using Skipgrams, Bigrams, and Part of Speech Features for Sentiment Classification of Twitter Messages
A Dependency Parser for Tweets
基於單語言機器翻譯技術改進中文文字蘊涵 (Improving Chinese Textural Entailment by Monolingual Machine Translation Technology) [In Chinese]
中文文字蘊涵系統之特徵分析 (Feature Analysis of Chinese Textual Entailment System) [In Chinese]
JU_CSE_NLP: Language Independent Cross-lingual Textual Entailment System
Combining fast_align with Hierarchical Sub-sentential Alignment for Better Word Alignments
Semantic Search in Documents Enriched by LOD-based Annotations
PronouncUR: An Urdu Pronunciation Lexicon Generator
Phonological Pun-derstanding
Document Re-ranking via Wikipedia Articles for Definition/Biography Type Questions
Morphosyntactic Analysis of the CHILDES and TalkBank Corpora
Empirical Studies in Learning to Read
Any-language frame-semantic parsing
MT and Topic-Based Techniques to Enhance Speech Recognition Systems for Professional Translators
Crowdsourcing Document Relevance Assessment with Mechanical Turk
Substring-based Transliteration with Conditional Random Fields
An Implementation of a Flexible Author-Reviewer Model of Generation using Genetic Algorithms
Tweet Normalization with Syllables
Humor Recognition and Humor Anchor Extraction
Evaluation and collection of proper name pronunciations online
Modeling Language Proficiency Using Implicit Feedback
Creative language explorations through a high-expressivity N-grams query language
Exploiting Syntactic Structures for Humor Recognition
Entity Linking for Spoken Language
Factored Language Model based on Recurrent Neural Network
A Web Application for Automated Dialect Analysis
Bekli:A Simple Approach to Twitter Text Normalization.
NgramQuery - Smart Information Extraction from Google N-gram using External Resources
Model Invertibility Regularization: Sequence Alignment With or Without Parallel Data
A Computational Approach to the Automation of Creative Naming
An MDL-based approach to extracting subword units for grapheme-to-phoneme conversion
Exploration of the Impact of Maximum Entropy in Recurrent Neural Network Language Models for Code-Switching Speech
A Comparison of Entity Matching Methods between English and Japanese Katakana
Using English Acoustic Models for Hindi Automatic Speech Recognition
Modeling Sentiment Association in Discourse for Humor Recognition
Inducing Search Keys for Name Filtering
Homonym Detection For Humor Recognition In Short Text
An Ensemble of Grapheme and Phoneme for Machine Transliteration
Recognizing Humour using Word Associations and Humour Anchor Extraction
Name Matching between Roman and Chinese Scripts: Machine Complements Human
Incorporating Pronunciation Variation into Different Strategies of Term Transliteration
A Real-life, French-accented Corpus of Air Traffic Control Communications
Ambient Search: A Document Retrieval System for Speech Streams
Pair Language Models for Deriving Alternative Pronunciations and Spellings from Pronunciation Dictionaries
Making Computers Laugh: Investigations in Automatic Humor Recognition
Predicting the Difficulty of Language Proficiency Tests
BRAINSUP: Brainstorming Support for Creative Sentence Generation
How to Memorize a Random 60-Bit String
Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System
Semi-Supervised Lexicon Mining from Parenthetical Expressions in Monolingual Web Pages
以語文特徵為基之中學閱讀測驗短文分級 (Using Linguistic Features to Classify Texts for Reading Comprehension Tests at the High School Levels) [In Chinese]
Augmenting Translation Models with Simulated Acoustic Confusions for Improved Spoken Language Translation
Computerized Analysis of a Verbal Fluency Test
A Hybrid Approach to English-Korean Name Transliteration
ProPOSEL: A Prosody and POS English Lexicon for Language Engineering
LDC Forced Aligner
Why is “SXSW” trending? Exploring Multiple Text Sources for Twitter Topic Summarization
Beyond Normalization: Pragmatics of Word Form in Text Messages
Automatic Recognition of Cantonese-English Code-Mixing Speech
Generating Topical Poetry
Readability Assessment of Translated Texts
Predicting the Spelling Difficulty of Words for Language Learners
Détection de transcriptions incorrectes de parole non-native dans le cadre de l’apprentissage de langues étrangères (Detection of incorrect transcriptions of non-native speech in the context of foreign language learning) [in French]
A Broad-Coverage Normalization System for Social Media Language
A Sequence Alignment Model Based on the Averaged Perceptron
Exploiting Machine-Transcribed Dialog Corpus to Improve Multiple Dialog States Tracking Methods
Multi-Tier Annotations in the Verbmobil Corpus
Bikers Accessing the Web: The SmartWeb Motorbike Corpus
SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus
Annotating Multi-media/Multi-modal Resources with ELAN
ELAN: a Professional Framework for Multimodality Research
Parsing the CHILDES Database: Methodology and Lessons Learned
I will shoot your shopping down and you can shoot all my tins—Automatic Lexical Acquisition from the CHILDES Database
Talkbank: Building an Open Unified Multimodal Database of Communicative Interaction
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive
POSCAT: A Morpheme-based Speech Corpus Annotation Tool
A Human Judgement Corpus and a Metric for Arabic MT Evaluation
Understanding Temporal Expressions in Emails
SuMT: A Framework of Summarization and MT
Toward General-Purpose Learning for Information Extraction
Breaking the Closed World Assumption in Text Classification
Multi-Human Dialogue Understanding for Assisting Artifact-Producing Meetings
Domain Adaptation of Maximum Entropy Language Models
Modeling the Use of Graffiti Style Features to Signal Social Relations within a Multi-Domain Learning Paradigm
Supersense Tagging for Arabic: the MT-in-the-Middle Attack
Heterogeneous Data Sources for Signed Language Analysis and Synthesis: The SignCom Project
Promoting Interoperability of Resources in META-SHARE
An Out-of-Domain Test Suite for Dependency Parsing of German
G-TUNA: a corpus of referring expressions in German, including duration information
Towards Using EEG to Improve ASR Accuracy
Socially Responsible NLP
Syntactic annotation of spoken utterances: A case study on the Czech Academic Corpus
Non-linear Mapping for Improved Identification of 1300+ Languages;
Event Extraction as Frame-Semantic Parsing
Modeling Consensus: Classifier Combination for Word Sense Disambiguation
A Survey of Arabic Named Entity Recognition and Classification
Recall-Oriented Learning of Named Entities in Arabic Wikipedia
Joint Inference for Event Coreference Resolution
Adapting an Example-Based Translation System to Chinese
Hands-On NLP for an Interdisciplinary Audience
A Dependency Parser for Tweets
HRItk: The Human-Robot Interaction ToolKit Rapid Development of Speech-Centric Interactive Systems in ROS
Speaking, Seeing, Understanding: Correlating semantic models with conceptual representation in the brain
Wordform- and Class-based Prediction of the Components of German Nominal Compounds in an AAC System
Teaching Applied Natural Language Processing: Triumphs and Tribulations
Towards Conversational QA: Automatic Identification of Problematic Situations and User Intent
Grouping business news stories based on salience of named entities
An ontology-based approach in the literary research: two case-studies
Matching Inconsistently Spelled Names in Automatic Speech Recognizer Output for Information Retrieval
Automatic Classification of Communicative Functions of Definiteness

Stacking or Supertagging for Dependency Parsing – What’s the Difference?
A Bayesian Mixed Effects Model of Literary Character
Dynamic Language Models for Streaming Text
Frame-Semantic Role Labeling with Heterogeneous Annotations
UNIBA: Sentiment Analysis of English Tweets Combining Micro-blogging, Lexicon and Semantic Features
Unsupervised Parsing for Generating Surface-Based Relation Extraction Patterns
Language Modeling with Power Low Rank Ensembles
LEXUS, a web-based tool for manipulating lexical resources lexicon
Studying the Effect of Input Size for Bayesian Word Segmentation on the Providence Corpus
Construction and Automatization of a Minnan Child Speech Corpus with some Research Findings
Metadata Collection Records for Language Resources
Active Learning for Building a Corpus of Questions for Parsing
Challenges in modality annotation in a Brazilian Portuguese Spontaneous Speech Corpus
High-accuracy Annotation and Parsing of CHILDES Transcripts
FOLKER: An Annotation Tool for Efficient Transcription of Natural, Multi-party Interaction
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Project
An annotated English child language database
A corpus of European Portuguese child and child-directed speech
Lower and higher estimates of the number of “true analogies” between sentences contained in a large multilingual corpus
The ACQDIV Database: Min(d)ing the Ambient Language
Representing and Rendering Linguistic Paradigms
Multimedia Language Resources
A large scale annotated child language construction database
The AnnCor CHILDES Treebank
Text Classification by Bootstrapping with Keywords, EM and Shrinkage
A Phonemic Corpus of Polish Child-Directed Speech
Multi-Class Confidence Weighted Algorithms
Domain Adaptation to Summarize Human Conversations
Learning User Embeddings from Emails
Summarizing Spoken and Written Conversations
Semi-supervised Speech Act Recognition in Emails and Forums
Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning
CrystalNest at SemEval-2017 Task 4: Using Sarcasm Detection for Enhancing Sentiment Classification and Quantification
Authorship Attribution of E-Mail: Comparing Classifiers over a New Corpus for Evaluation
The Role of Roles in Classifying Annotated Biomedical Text
Inconsistency Detection in Semantic Annotation
Supersense Embeddings: A Unified Model for Supersense Interpretation, Prediction, and Utilization
Extreme Adaptation for Personalized Neural Machine Translation
Metaphor Detection with Cross-Lingual Model Transfer
“A Spousal Relation Begins with a Deletion of engage and Ends with an Addition of divorce”: Learning State Changing Verbs from Wikipedia Revision History
Identifying Semantic Edit Intentions from Revisions in Wikipedia
Pushing the Limits of Translation Quality Estimation
Using Bilingual Parallel Corpora for Cross-Lingual Textual Entailment
Discrepancy Between Automatic and Manual Evaluation of Summaries
A Corpus of Preposition Supersenses
DART: a Dataset of Arguments and their Relations on Twitter
Large-scale Cloze Test Dataset Created by Teachers
Embracing Non-Traditional Linguistic Resources for Low-resource Language Name Tagging
A Continuously Growing Dataset of Sentential Paraphrases
The WebNLG Challenge: Generating Text from RDF Data
Quality Estimation of English-French Machine Translation: A Detailed Study of the Role of Syntax
Local Histograms of Character N-grams for Authorship Attribution
Prepositional Phrase Attachment over Word Embedding Products
Using fMRI activation to conceptual stimuli to evaluate methods for extracting conceptual representations from corpora
Seernet at EmoInt-2017: Tweet Emotion Intensity Estimator
Can Chinese Phonemes Improve Machine Transliteration?: A Comparative Study of English-to-Chinese Transliteration Models
Edit Categories and Editor Role Identification in Wikipedia
Universal Dependencies for Portuguese
Measuring corpus homogeneity using a range of measures for inter-document distance
To Memorize or to Predict: Prominence labeling in Conversational Speech
Impact of MWE Resources on Multiword Recognition
New Experiments in Distributional Representations of Synonymy
A Compositional and Interpretable Semantic Space
Learning to Follow Navigational Directions
IITPB at SemEval-2017 Task 5: Sentiment Prediction in Financial Text
Illegal is not a Noun: Linguistic Form for Detection of Pejorative Nominalizations
Encoding Conversation Context for Neural Keyphrase Extraction from Microblog Posts
Annotating similes in literary texts
Named Entity Recognition and Hashtag Decomposition to Improve the Classification of Tweets
Elucidating Conceptual Properties from Word Embeddings
Learning to Identify Definitions using Syntactic Features
Event Embeddings for Semantic Script Modeling
Learning to Search for Recognizing Named Entities in Twitter
BLANC: Learning Evaluation Metrics for MT
A Combination of Topic Models with Max-margin Learning for Relation Detection
Joint Information Extraction and Reasoning: A Scalable Statistical Relational Learning Approach
Yuanfudao at SemEval-2018 Task 11: Three-way Attention and Relational Knowledge for Commonsense Machine Comprehension
THU_NGN at SemEval-2018 Task 3: Tweet Irony Detection with Densely connected LSTM and Multi-task Learning
Crowdsourcing High-Quality Parallel Data Extraction from Twitter
GradAscent at EmoInt-2017: Character and Word Level Recurrent Neural Network Models for Tweet Emotion Intensity Detection
Language Identification: The Long and the Short of the Matter
JU_NLP at SemEval-2016 Task 6: Detecting Stance in Tweets using Support Vector Machines
Parallel Implementations of Word Alignment Tool
Resolving Task Specification and Path Inconsistency in Taxonomy Construction
Real Time Adaptive Machine Translation for Post-Editing with cdec and TransCenter
Entity Annotation based on Inverse Index Operations
Context Sensitive Lemmatization Using Two Successive Bidirectional Gated Recurrent Networks
HLP@UPenn at SemEval-2017 Task 4A: A simple, self-optimizing text classification system combining dense and sparse vectors
Unsupervised Learning of Prototypical Fillers for Implicit Semantic Role Labeling
Interpretable Semantic Vectors from a Joint Model of Brain- and Text- Based Meaning
Microblog Conversation Recommendation via Joint Modeling of Topics and Discourse
Of Words, Eyes and Brains: Correlating Image-Based Distributional Semantic Models with Neural Representations of Concepts
Representation Based Translation Evaluation Metrics
Recursive Top-down Fuzzy Match : New Perspectives on Memory-based Parsing
Microblogs as Parallel Corpora
Detecting Nastiness in Social Media
“i have a feeling trump will win..................”: Forecasting Winners and Losers from User Predictions on Twitter
Feature-Rich Twitter Named Entity Recognition and Classification
Frame Semantics across Languages: Towards a Multilingual FrameNet
Modified Distortion Matrices for Phrase-Based Statistical Machine Translation
Symmetric Pattern Based Word Embeddings for Improved Word Similarity Prediction
SystemT: An Algebraic Approach to Declarative Information Extraction
Tweety at SemEval-2018 Task 2: Predicting Emojis using Hierarchical Attention Neural Networks and Support Vector Machine
Integrating Optical Character Recognition and Machine Translation of Historical Documents
Acquisition of Syntactic Simplification Rules for French
Towards a General Rule for Identifying Deceptive Opinion Spam
Language Identification and Analysis of Code-Switched Social Media Text
Assessing linguistically aware fuzzy matching in translation memories
Time Expression Analysis and Recognition Using Syntactic Token Types and General Heuristic Rules
THU_NGN at SemEval-2018 Task 2: Residual CNN-LSTM Network with Attention for English Emoji Prediction
MPST: A Corpus of Movie Plot Synopses with Tags
Neural Activation Semantic Models: Computational lexical semantic models of localized neural activations
Classification from Full Text: A Comparison of Canonical Sections of Scientific Papers
SimiHawk at SemEval-2016 Task 1: A Deep Ensemble System for Semantic Textual Similarity
Twitter Named Entity Extraction and Linking Using Differential Evolution
UW-CSE at SemEval-2016 Task 10: Detecting Multiword Expressions and Supersenses using Double-Chained Conditional Random Fields
Praat on the Web: An Upgrade of Praat for Semi-Automatic Speech Annotation
Predicting Native Language from Gaze
A Cascade Method for Detecting Hedges and their Scope in Natural Language Text
Scalable Construction and Reasoning of Massive Knowledge Bases
Cutting the Long Tail: Hybrid Language Models for Translation Style Adaptation
A Sense-Based Translation Model for Statistical Machine Translation
Exploring Semantic Representation in Brain Activity Using Word Embeddings
Constructing Task-Specific Taxonomies for Document Collection Browsing
The U.S. Policy Agenda Legislation Corpus Volume 1 - a Language Resource from 1947 - 1998
Learning Paraphrasing for Multiword Expressions
Generalizing Dependency Features for Opinion Mining
Sprinkling Topics for Weakly Supervised Text Classification
A Hybrid Text Classification Approach for Analysis of Student Essays
Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation
MT Tuning on RED: A Dependency-Based Evaluation Metric
Is this a wampimuk? Cross-modal mapping between distributional semantics and the visual world
Novelty Goes Deep. A Deep Neural Solution To Document Level Novelty Detection
A Quantitative Analysis of Lexical Differences Between Genders in Telephone Conversations
The First Surface Realisation Shared Task: Overview and Evaluation Results
Learning a POS tagger for AAVE-like language
Demographic Dialectal Variation in Social Media: A Case Study of African-American English
Document-Level Automatic MT Evaluation based on Discourse Representations
MTNT: A Testbed for Machine Translation of Noisy Text
LIUM’s SMT Machine Translation Systems for WMT 2011
Non-distributional Word Vector Representations
A Comparative Study of Syntactic Parsers for Event Extraction
Understanding Mental States in Natural Language
Matrix Factorization using Window Sampling and Negative Sampling for Improved Word Representations
CASICT-DCU Participation in WMT2015 Metrics Task
THU_NGN at SemEval-2018 Task 1: Fine-grained Tweet Sentiment Intensity Analysis with Attention CNN-LSTM
Paraphrase Identification and Semantic Similarity in Twitter with Simple Features
Towards Automatically Classifying Depressive Symptoms from Twitter Data for Population Health
Which Tumblr Post Should I Read Next?
IITP at SemEval-2017 Task 8 : A Supervised Approach for Rumour Evaluation
Mining Parallel Corpora from Sina Weibo and Twitter
The DCU Dependency-Based Metric in WMT-MetricsMATR 2010
Learning when to trust distant supervision: An application to low-resource POS tagging using cross-lingual projection
Machine Learning Disambiguation of Quechua Verb Morphology
RED: A Reference Dependency Based MT Evaluation Metric
FBK-HLT: An Effective System for Paraphrase Identification and Semantic Similarity in Twitter
Unbabel’s Participation in the WMT17 Translation Quality Estimation Shared Task
Detecting Context Dependent Messages in a Conversational Environment
Simple or Complex? Classifying Questions by Answering Complexity
The Karlsruhe Institute of Technology Translation Systems for the WMT 2011
ASU: An Experimental Study on Applying Deep Learning in Twitter Named Entity Recognition.
Identifying Real or Fake Articles: Towards better Language Modeling
IITP at EmoInt-2017: Measuring Intensity of Emotions using Sentence Embeddings and Optimized Features
Identifying Effective Translations for Cross-lingual Arabic-to-English User-generated Speech Search
Identifying Experimental Techniques in Biomedical Literature
IMS at EmoInt-2017: Emotion Intensity Prediction with Affective Norms, Automatically Extended Resources and Deep Learning
Automatic Extraction of News Values from Headline Text
Double Embeddings and CNN-based Sequence Labeling for Aspect Extraction
The Karlsruhe Institute for Technology Translation System for the ACL-WMT 2010
LIUM’s SMT Machine Translation Systems for WMT 2012
Labeling Unlabeled Data using Cross-Language Guided Clustering
Unbabel’s Participation in the WMT16 Word-Level Translation Quality Estimation Shared Task
RACE: Large-scale ReAding Comprehension Dataset From Examinations
Testing Semantic Similarity Measures for Extracting Synonyms from a Corpus
The Web Library of Babel: evaluating genre collections
TwiSe at SemEval-2017 Task 4: Five-point Twitter Sentiment Classification and Quantification
Story Assembly in a Dyslexia Fluency Tutor
evision PDF of 'Recognizing Counterfactual Thinking in Social Media Texts
Activity detection for information access to oral communication
Neural Models for Key Phrase Extraction and Question Generation
Is “Universal Syntax” Universally Useful for Learning Distributed Word Representations?
A Joint Sequential and Relational Model for Frame-Semantic Parsing
Grammatical Relations in Chinese: GB-Ground Extraction and Data-Driven Parsing
Typed Tensor Decomposition of Knowledge Bases for Relation Extraction
Parsing for Grammatical Relations via Graph Merging
Language Model-Based Document Clustering Using Random Walks
A Joint Model of Conversational Discourse Latent Topics on Microblogs
UWB at SemEval-2018 Task 1: Emotion Intensity Detection in Tweets
TwiSE at SemEval-2016 Task 4: Twitter Sentiment Classification
Textual Entailment based Question Generation
Character Sequence Models for Colorful Words
Making Dependency Labeling Simple, Fast and Accurate
Learning from Post-Editing: Online Model Adaptation for Statistical Machine Translation
Factoring Adjunction in Hierarchical Phrase-Based SMT
The Grammar of English Deverbal Compounds and their Meaning
Gating Mechanisms for Combining Character and Word-level Word Representations: an Empirical Study
On the Feasibility of Automated Detection of Allusive Text Reuse
The binary trio at SemEval-2019 Task 5: Multitarget Hate Speech Detection in Tweets
Using Human Attention to Extract Keyphrase from Microblog Post
Handling Divergent Reference Texts when Evaluating Table-to-Text Generation
On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models
Blackbox Meets Blackbox: Representational Similarity & Stability Analysis of Neural Language Models and Brains
Proceedings of the 27th International Conference on Computational Linguistics: Tutorial Abstracts
SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval)
MIDAS at SemEval-2019 Task 6: Identifying Offensive Posts and Targeted Offense from Twitter
Cross-Lingual Syntactic Transfer through Unsupervised Adaptation of Invertible Projections
JHU 2019 Robustness Task System Description
Major Life Event Extraction from Twitter based on Congratulations/Condolences Speech Acts
Weakly Supervised User Profile Extraction from Twitter
Leveraging Knowledge Bases in LSTMs for Improving Machine Reading
Learning Verbs on the Fly
Task-oriented Evaluation of Syntactic Parsers and Their Representations
A Semiparametric Gaussian Copula Regression Model for Predicting Financial Risks from Earnings Calls
Graph Based Decoding for Event Sequencing and Coreference Resolution
AUTOLEX: An Automatic Lexicon Builder for Minority Languages Using an Open Corpus
Categorizing Web Pages as a Preprocessing Step for Information Extraction
Connotation Frames of Power and Agency in Modern Films
The CMU-ARK German-English Translation System
Role-specific Language Models for Processing Recorded Neuropsychological Exams
NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches
Native Language Identification using Phonetic Algorithms
English-Korean Named Entity Transliteration Using Statistical Substring-based and Rule-based Approaches
Designing Agreement Features for Realization Ranking
Perceptron Reranking for CCG Realization
Facilitating Translation Using Source Language Paraphrase Lattices
Interpreting BLEU/NIST Scores: How Much Improvement do We Need to Have a Better System?
Identifying the L1 of non-native writers: the CMU-Haifa system;
CEPLEXicon ― A Lexicon of Child European Portuguese
Leveraging Inflection Tables for Stemming and Lemmatization.
NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches
Morphological Analysis without Expert Annotation
English-Korean Named Entity Transliteration Using Statistical Substring-based and Rule-based Approaches
Learning a Deep Hybrid Model for Semi-Supervised Text Classification
The Second QALB Shared Task on Automatic Text Correction for Arabic
Large Scale Arabic Error Annotation: Guidelines and Framework
Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation
Correction Annotation for Non-Native Arabic Texts: Guidelines and Corpus
The First QALB Shared Task on Automatic Text Correction for Arabic
Building a Dataset for Summarization and Keyword Extraction from Emails
Event Nugget and Event Coreference Annotation
The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems
UT-DB: An Experimental Study on Sentiment Analysis in Twitter
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data
Transferring from Formal Newswire Domain with Hypernet for Twitter POS Tagging
TUGAS: Exploiting unlabelled data for Twitter sentiment analysis
What I’ve learned about annotating informal text (and why you shouldn’t take my word for it)
Simultaneous Feature Selection and Parameter Optimization Using Multi-objective Optimization for Sentiment Analysis
Experiments with crowdsourced re-annotation of a POS tagging data set
Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters
SU-FMI: System Description for SemEval-2014 Task 9 on Sentiment Analysis in Twitter
Tune Your Brown Clustering, Please
A Unified Model for Topics, Events and Users on Twitter
Sarcastic or Not: Word Embeddings to Predict the Literal or Sarcastic Meaning of Words
ECNU: Expression- and Message-level Sentiment Orientation Classification in Twitter Using Multiple Effective Features
Sentiment Lexicon Interpolation and Polarity Estimation of Objective and Out-Of-Vocabulary Words to Improve Sentiment Classification on Microblogging
QCRI at SemEval-2016 Task 4: Probabilistic Methods for Binary and Ordinal Quantification
Shallow Parsing Pipeline - Hindi-English Code-Mixed Social Media Text
IITP: Hybrid Approach for Text Normalization in Twitter
CodeX: Combining an SVM Classifier and Character N-gram Language Models for Sentiment Analysis on Twitter Text
IITP: Multiobjective Differential Evolution based Twitter Named Entity Recognition
Semi-Supervised Learning of Sequence Models with Method of Moments
UNIBA: Sentiment Analysis of English Tweets Combining Micro-blogging, Lexicon and Semantic Features
Indian Institute of Technology-Patna: Sentiment Analysis in Twitter
TeamX: A Sentiment Analyzer with Enhanced Lexicon Mapping and Weighting Scheme for Unbalanced Data
Negation Scope Detection for Twitter Sentiment Analysis
Learning part-of-speech taggers with inter-annotator agreement loss
The Unreasonable Effectiveness of Word Representations for Twitter Named Entity Recognition
KLUE: Simple and robust methods for polarity classification
What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter Emojis
Adapting taggers to Twitter with not-so-distant supervision
Question Answering in Restricted Domains: An Overview
Mining Arguments From 19th Century Philosophical Texts Using Topic Based Modelling
Transliteration Alignment
Evaluation of Pronunciation Variants in the ASR Lexicon for Different Speaking Styles
Korean Children’s Spoken English Corpus and an Analysis of its Pronunciation Variability
SIDE: The Summarization Integrated Development Environment
Using a Wikipedia-based Semantic Relatedness Measure for Document Clustering
Morphological Segmentation for Keyword Spotting
Combining Probability-Based Rankers for Action-Item Detection
PronouncUR: An Urdu Pronunciation Lexicon Generator
Phonological Pun-derstanding
A case study on using speech-to-translation alignments for language documentation
Management of Metadata in Linguistic Fieldwork: Experience from the ACLA Project
Lekbot: A talking and playing robot for children with disabilities
Querying Both Time-aligned and Hierarchical Corpora with NXT Search
Towards Broad-coverage Meaning Representation: The Case of Comparison Structures
Learning to Jointly Predict Ellipsis and Comparison Structures
Telling Apart Tweets Associated with Controversial versus Non-Controversial Topics
Hope at SemEval-2019 Task 6: Mining social media language to discover offensive language
A Corpus and Model Integrating Multiword Expressions and Supersenses
Unsupervised Discovery of Biographical Structure from Text
Random Walk Inference and Learning in A Large Scale Knowledge Base
Which Noun Phrases Denote Which Concepts?
Discovering Relations between Noun Categories
Joint Inference for Event Coreference Resolution
Learning Latent Personas of Film Characters
Simplified Dependency Annotations with GFL-Web
Activity detection for information access to oral communication
Sentiment Analysis using Imperfect Views from Spoken Language and Acoustic Modalities
A Web-based Demonstrator of a Multi-lingual Phrase-based Translation System
SYNGRAPH: A Flexible Matching Method based on Synonymous Expression Extraction from an Ordinary Dictionary and a Web Corpus
Semi-Supervised SimHash for Efficient Document Similarity Search
Learning to Define Terms in the Software Domain
Unsupervised Discovery of Biographical Structure from Text
FBK-HLT: An Application of Semantic Textual Similarity for Answer Selection in Community Question Answering
A Linguistic Knowledge Discovery Tool: Very Large Ngram Database Search with Arbitrary Wildcards
N-Best Rescoring Based on Pitch-accent Patterns
Phrase Dependency Machine Translation with Quasi-Synchronous Tree-to-Tree Features
evision PDF of 'What Can We Get From 1000 Tokens? A Case Study of Multilingual POS Tagging For Resource-Poor Languages
以語音辨識與評分輔助口說英文學習 (Spoken English Learning Based on Speech Recognition and Assessment) [In Chinese]
A Phonemic Corpus of Polish Child-Directed Speech
Language Model Adaptation for Statistical Machine Translation Based on Information Retrieval
The ACQDIV Database: Min(d)ing the Ambient Language
A Phonemic Corpus of Polish Child-Directed Speech
Prediction of a Movie’s Success From Plot Summaries Using Deep Learning Models
Multilingual Short Text Responses Clustering for Mobile Educational Activities: a Preliminary Exploration
Discriminative Lexical Semantic Segmentation with Gaps: Running the MWE Gamut
A Corpus and Model Integrating Multiword Expressions and Supersenses
SemEval-2016 Task 10: Detecting Minimal Semantic Units and their Meanings (DiMSUM)
Dudley North visits North London: Learning When to Transliterate to Arabic
The Dutch LESLLA Corpus
Neutralizing Linguistically Problematic Annotations in Unsupervised Dependency Parsing Evaluation
Word-Sense Disambiguation for Machine Translation
Book Reviews: Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms by Thorsten Joachims; Anaphora Resolution by Ruslan Mitkov
An Interactive Tool for Supporting Error Analysis for Text Mining

SMT and SPE Machine Translation Systems for WMT‘09
Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter
Documents and Dependencies: an Exploration of Vector Space Models for Semantic Composition
Toward General-Purpose Learning for Information Extraction
Interpretable Semantic Vectors from a Joint Model of Brain- and Text- Based Meaning
LIUM SMT Machine Translation System for WMT 2010
Spectral Clustering for Example Based Machine Translation
Improving “Email Speech Acts” Analysis via N-gram Selection
Determiner-Established Deixis to Communicative Artifacts in Pedagogical Text
Conversational Strategies for Robustly Managing Dialog in Public Spaces
Two Approaches to Metaphor Detection
Development of Resources for a Bilingual Automatic Index System of Broadcast News in Basque and Spanish
Recognition of Polish Temporal Expressions
PodCastle: A Spoken Document Retrieval Service Improved by Anonymous User Contributions
Multilingual and cross-lingual news topic tracking
The Automatic Generation of Formal Annotations in a Multimedia Indexing and Searching Environment
Improved Recognition and Normalisation of Polish Temporal Expressions
Parsing as Reduction
Aligning Opinions: Cross-Lingual Opinion Mining with Dependencies
Turning on the Turbo: Fast Third-Order Non-Projective Turbo Parsers
Fast and Robust Compressive Summarization with Dual Decomposition and Multi-Task Learning
Mapping Verbs in Different Languages to Knowledge Base Relations using Web Text as Interlingua
A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness
Socially Responsible NLP
Augmenting English Adjective Senses with Supersenses
The Italian NESPOLE! Corpus: a Multilingual Database with Interlingua Annotation in Tourism and Medical Domains
Lexical Level Distribution of Metadiscourse in Spoken Language
Neural Fuzzy Repair: Integrating Fuzzy Matches into Neural Machine Translation
#Emotional Tweets
Crowdsourcing and annotating NER for Twitter #drift
Signatures, Typed Feature Structures and RDFS
Boosting Statistical Machine Translation by Lemmatization and Linear Interpolation
Annotating Large Email Datasets for Named Entity Recognition with Mechanical Turk
An Efficient Annotation for Phrasal Verbs using Dependency Information
Named Entity Recognition for Linguistic Rapid Response in Low-Resource Languages: Sorani Kurdish and Tajik

Improving Vector Space Word Representations Using Multilingual Correlation
From Protein-Protein Interaction to Molecular Event Extraction
Overview of BioNLP’09 Shared Task on Event Extraction
Improving Peer Feedback Prediction: The Sentence Level is Right
Automated Rating of ESL Essays
Leveraging Knowledge Bases in LSTMs for Improving Machine Reading
Predicting Tasks in Goal-Oriented Spoken Dialog Systems using Semantic Knowledge Bases
Arap-Tweet: A Large Multi-Dialect Twitter Corpus for Gender, Age and Language Variety Identification
Knowledge Acquisition Strategies for Goal-Oriented Dialog Systems
A Knowledge-Intensive Model for Prepositional Phrase Attachment
Cross-Lingual Information Retrieval and Semantic Interoperability for Cultural Heritage Repositories
Odds of Successful Transfer of Low-Level Concepts: a Key Metric for Bidirectional Speech-to-Speech Machine Translation in DARPA’s TRANSTAC Program
METEOR: An Automatic Metric for MT Evaluation with High Levels of Correlation with Human Judgments
Edit Distance: A Metric for Machine Translation Evaluation
Meteor, M-BLEU and M-TER: Evaluation Metrics for High-Correlation with Human Rankings of Machine Translation Output
Meteor Universal: Language Specific Translation Evaluation for Any Target Language
AppDialogue: Multi-App Dialogues for Intelligent Assistants
Predicting the Evocation Relation between Lexicalized Concepts
Knowledge-Based Labeling of Semantic Relationships in English for better language modelling
Tempo-Lexical Context Driven Word Embedding for Cross-Session Search Task Extraction
Cross-domain Feature Selection for Language Identification
A Text Categorization Based on a Summarization Extraction
A Proposal for combining “general” and specialized frames

On Grammaticality in the Syntactic Annotation of Learner Language
High-accuracy Annotation and Parsing of CHILDES Transcripts
The AnnCor CHILDES Treebank
Statistical Modality Tagging from Rule-based Annotations and Crowdsourcing
Supervised Phrase Table Triangulation with Neural Word Embeddings for Low-Resource Languages
Improving Statistical Machine Translation Performance by Training Data Selection and Optimization
Interactive ASR Error Correction for Touchscreen Devices
DialPort: A General Framework for Aggregating Dialog Systems
The MADAR Arabic Dialect Corpus and Lexicon
HTM: A Topic Model for Hypertexts
From Pipedreams to Products, and Promise!
The Italian NESPOLE! Corpus: a Multilingual Database with Interlingua Annotation in Tourism and Medical Domains
Context-aware Frame-Semantic Role Labeling
An Investigation on the Influence of Frequency on the Lexical Organization of Verbs
A Graph-based Lattice Dependency Parser for Joint Morphological Segmentation and Syntactic Analysis
The New Edition of the Natural Language Software Registry (an Initiative of ACL hosted at DFKI)
Embracing Non-Traditional Linguistic Resources for Low-resource Language Name Tagging
Why does PairDiff work? - A Mathematical Analysis of Bilinear Relational Compositional Operators for Analogy Detection
Generating Typed Dependency Parses from Phrase Structure Parses
Script Independent Word Spotting in Multilingual Documents
A Walk on the Other Side: Using SMT Components in a Transfer-Based Translation System
Discretization Based Learning for Information Retrieval
SconeEdit: A Text-guided Domain Knowledge Editor
Distributional Identification of Non-Referential Pronouns
The Influence of Data Homogeneity on NLP System Performance
Linguistic Miner: An Italian Linguistic Knowledge System
Utterance-Level Multimodal Sentiment Analysis
Jointly Learning Grounded Task Structures from Language Instruction and Visual Demonstration
Scalable Statistical Relational Learning for NLP
Extracting Personal Names from Email: Applying Named Entity Recognition to Informal Text
Computational Analysis of Referring Expressions in Narratives of Picture Books
Learning to Order Natural Language Texts
Stochastic Iterative Alignment for Machine Translation Evaluation
Boosting Statistical Machine Translation by Lemmatization and Linear Interpolation
A Simulation-based Framework for Spoken Language Understanding and Action Selection in Situated Interaction
Unsupervised Text Recap Extraction for TV Series
Frame-Semantic Parsing
Priberam Compressive Summarization Corpus: A New Multi-Document Summarization Corpus for European Portuguese
Graph Based Decoding for Event Sequencing and Coreference Resolution
Java Libraries for Accessing the Princeton Wordnet: Comparison and Evaluation
Exploiting a Multilingual Web-based Encyclopedia for Bilingual Terminology Extraction
Simple supervised document geolocation with geodesic grids
Clustering dictionary definitions using Amazon Mechanical Turk
Learning a Compositional Semantics for Freebase with an Open Predicate Vocabulary
Inducing Latent Semantic Relations for Structured Distributional Semantics
LoonyBin: Keeping Language Technologists Sane through Automated Management of Experimental (Hyper)Workflows
Learning Translation Rules from Bilingual English - Filipino Corpus
Speech Translation for Triage of Emergency Phonecalls in Minority Languages
Learning a Stopping Criterion for Active Learning for Word Sense Disambiguation and Text Classification
The Swedish Model of Public Outreach of Linguistics to secondary school Students through Olympiads
MIKE: An Interactive Microblogging Keyword Extractor using Contextual Semantic Smoothing
LAPPS/Galaxy: Current State and Next Steps
Dependency Parsing for Weibo: An Efficient Probabilistic Logic Programming Approach
Predicting Grammaticality on an Ordinal Scale
Error Detection for Statistical Machine Translation Using Linguistic Features
Reranking Translation Hypotheses Using Structural Properties
An algorithm for open text semantic parsing
Lycos Retriever: An Information Fusion Engine
Integrating lexical, syntactic and system-based features to improve Word Confidence Estimation in SMT
Analysis of Link Grammar on Biomedical Dependency Corpus Targeted at Protein-Protein Interactions
Error Detection Using Linguistic Features
Identifying and Avoiding Confusion in Dialogue with People with Alzheimer’s Disease
Extensions to the GrETEL Treebank Query Application
The ACQDIV Database: Min(d)ing the Ambient Language
Towards a Model of Prediction-based Syntactic Category Acquisition: First Steps with Word Embeddings
Semi-Supervised Frame-Semantic Parsing for Unknown Predicates
Turbo Parsers: Dependency Parsing by Approximate Variational Inference
Learning finite state word representations for unsupervised Twitter adaptation of POS taggers
Minimal Dependency Length in Realization Ranking
LYSGROUP: Adapting a Spanish microtext normalization system to English.
Classifying Tweet Level Judgements of Rumours in Social Media
Semi-supervised Dependency Parsing using Bilexical Contextual Features from Auto-Parsed Data
Frame-Semantic Role Labeling with Heterogeneous Annotations
Bi-directional Inter-dependencies of Subjective Expressions and Targets and their Value for a Joint Model
Tree Edit Models for Recognizing Textual Entailments, Paraphrases, and Answers to Questions
Challenges of studying and processing dialects in social media
A Language Model Approach to Keyphrase Extraction
The Translation Correction Tool: English-Spanish User Studies
Towards Normalising Konkani-English Code-Mixed Social Media Text
Conceptor Debiasing of Word Representations Evaluated on WEAT
The Role of Protected Class Word Lists in Bias Identification of Contextualized Word Representations
Branch and Bound Algorithm for Dependency Parsing with Non-local Features
(Re)ranking Meets Morphosyntax: State-of-the-art Results from the SPMRL 2013 Shared Task
A Graph-based Lattice Dependency Parser for Joint Morphological Segmentation and Syntactic Analysis
Introducing the IMS-Wrocław-Szeged-CIS entry at the SPMRL 2014 Shared Task: Reranking and Morpho-syntax meet Unlabeled Data
Exploiting Variant Corpora for Machine Translation
Discriminative Lexical Semantic Segmentation with Gaps: Running the MWE Gamut
Artificial IntelliDance: Teaching Machine Learning through a Choreography
Multilingual Terminology Extraction and Validation
A Discriminative Latent Variable-Based “DE” Classifier for Chinese-English SMT
Identifying Metaphorical Word Use with Tree Kernels

To Sing like a Mockingbird
Lexical Discovery with an Enriched Semantic Network
Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems
Social Links from Latent Topics in Microblogs
Compressing Trigram Language Models With Golomb Coding
String Transduction with Target Language Models and Insertion Handling
This Text Has the Scent of Starbucks: A Laplacian Structured Sparsity Model for Computational Branding Analytics
Joint Syntactic and Semantic Parsing with Combinatory Categorial Grammar
Proceedings of the HLT-NAACL 2003 Workshop on Research Directions in Dialogue Processing
A Development Environment for Configurable Meta-Annotators in a Pipelined NLP Architecture
Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench
A Corpus and Model Integrating Multiword Expressions and Supersenses
Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic
I Can Has Cheezburger? A Nonparanormal Approach to Combining Textual and Visual Information for Predicting and Generating Popular Meme Descriptions
A VIEW of Russian: Visual Input Enhancement and Adaptive Feedback
Exploring Measures of “Readability” for Spoken Language: Analyzing linguistic features of subtitles to identify age-specific TV programs
Insights from Russian second language readability classification: complexity-dependent training requirements, and feature evaluation of multiple categories
On The Applicability of Readability Models to Web Texts
On Improving the Accuracy of Readability Classification using Insights from Second Language Acquisition
Enhancing Authentic Web Pages for Language Learners
Assessing the relative reading level of sentence pairs for text simplification
Readability Classification for German using Lexical, Syntactic, and Morphological Features
Entailment due to Syntactically Encoded Semantic Relationships
Two-Stage Stochastic Natural Language Generation for Email Synthesis by Modeling Sender Style and Topic Structure
Two-Stage Stochastic Email Synthesizer
Continuous fluency tracking and the challenges of varying text complexity
Discourse Coherence in the Wild: A Dataset, Evaluation and Methods
Interpretable Word Embedding Contextualization
Combining Shallow and Deep Learning for Aggressive Text Detection
Using Morphemes from Agglutinative Languages like Quechua and Finnish to Aid in Low-Resource Translation
SSMT:A Machine Translation Evaluation View To Paragraph-to-Sentence Semantic Similarity
A Joint Model of Conversational Discourse Latent Topics on Microblogs
Reusable workflows for gender prediction
Simple and Effective Paraphrastic Similarity from Parallel Translations
LTL-UDE at SemEval-2019 Task 6: BERT and Two-Vote Classification for Categorizing Offensiveness
Beyond BLEU:Training Neural Machine Translation with Semantic Similarity
STCP: Simplified-Traditional Chinese Conversion and Proofreading
The Translation Correction Tool: English-Spanish User Studies
Transliteration Better than Translation? Answering Code-mixed Questions over a Knowledge Base
Archivus: A Multimodal System for Multimedia Meeting Browsing and Retrieval
Phrase-Based Statistical Machine Translation: A Level of Detail Approach"
Text Mining Techniques for Leveraging Positively Labeled Data
Exploring Word Embeddings for Unsupervised Textual User-Generated Content Normalization
Modeling Latent-Dynamic in Shallow Parsing: A Latent Conditional Model with Improved Inference
Scalable Statistical Relational Learning for NLP
Degrees of Orality in Speech-like Corpora: Comparative Annotation of Chat and E-mail Corpora
CIEMPIESS: A New Open-Sourced Mexican Spanish Radio Corpus
A procedure assistant for astronauts in a functional programming architecture, with step previewing and spoken correction of dialogue moves
Transfer Learning for Entity Recognition of Novel Classes
Collectively Representing Semi-Structured Data from the Web
Rich Source-Side Context for Statistical Machine Translation
Studying the Effect of Input Size for Bayesian Word Segmentation on the Providence Corpus
Minimizing Word Error Rate in Textual Summaries of Spoken Language
A Corpus and Model Integrating Multiword Expressions and Supersenses
More or less supervised supersense tagging of Twitter
Towards Automatic Topical Question Generation
Augmenting English Adjective Senses with Supersenses
The Structure and Generality of Spoken Route Instructions
The Translation Correction Tool: English-Spanish User Studies
Incorporating Vector Space Similarity in Random Walk Inference over Knowledge Bases
Efficient and Expressive Knowledge Base Completion Using Subgraph Feature Extraction
Mining Web Sites Using Unsupervised Adaptive Information Extraction
Improving Learning and Inference in a Large Knowledge-Base using Latent Syntactic Cues
Text Classification by Bootstrapping with Keywords, EM and Shrinkage
Using Domain Knowledge about Medications to Correct Recognition Errors in Medical Report Creation
CMU: Arc-Factored, Discriminative Semantic Dependency Parsing
Towards Automatic Description of Knowledge Components,Rainbow
學術會議資訊之擷取及其應用 (Information Extraction for Academic Conference and It’s Application) [In Chinese]
Automatic Recognition of Conversational Strategies in the Service of a Socially-Aware Dialog System
Untangling Text Data Mining
Joint Online Spoken Language Understanding and Language Modeling With Recurrent Neural Networks
Towards Data and Goal Oriented Analysis: Tool Inter-operability and Combinatorial Comparison
A Latent Variable Model for Geographic Lexical Variation
Estimating User Location in Social Media with Stacked Denoising Auto-encoders
Using Log-linear Models for Tuning Machine Translation Output
Towards Domain Adaptation for Parsing Web Data
Improving Translation Selection with Supersenses
SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining
Determining Term Subjectivity and Term Orientation for Opinion Mining
Supersense Tagging for Arabic: the MT-in-the-Middle Attack
Data-driven Measurement of Child Language Development with Simple Syntactic Templates
A Web-based Annotation Framework For Large-Scale Text Correction
How to Produce Unseen Teddy Bears: Improved Morphological Processing of Compounds in SMT
Cache-based Document-level Statistical Machine Translation
Rich Source-Side Context for Statistical Machine Translation
Instance Weighting for Neural Machine Translation Domain Adaptation
Classifier-Based Tense Model for SMT
Parametric Models of Linguistic Count Data
Web Mining for Unsupervised Classification
Structure-based Clustering of Novels
Linguistic Considerations in Automatic Question Generation
Towards Automatic Topical Question Generation
Towards Topic-to-Question Generation
Learning Translation Rules from Bilingual English - Filipino Corpus
Improving Machine Translation Performance by Exploiting Non-Parallel Corpora
Annotating Large Email Datasets for Named Entity Recognition with Mechanical Turk
Recognizing Emotions in Video Using Multimodal DNN Feature Fusion
Proceedings of the Fourth SIGdial Workshop of Discourse and Dialogue