NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Task
Corpus Development
Total Paper Mentions:- 115
First ACL Paper:- 1992
Latest ACL Paper:- 2019
Authors
Papers
Conferences
Amitava Das
Associated works : 3
Stephanie Strassel
Associated works : 3
James Pustejovsky
Associated works : 3
Eiichiro Sumita
Associated works : 3
Sivaji Bandyopadhyay
Associated works : 3
Mark Liberman
Associated works : 3
Amália Mendes
Associated works : 2
Mark Hepple
Associated works : 2
Udo Hahn
Associated works : 2
David Day
Associated works : 2
Manuela Sanguinetti
Associated works : 2
Sven Buechel
Associated works : 2
Ikechukwu Onyenwe
Associated works : 2
Serge Sharoff
Associated works : 2
Christopher Cieri
Associated works : 2
Cristina Bosco
Associated works : 2
Kazuaki Maeda
Associated works : 2
Niels Ole Bernsen
Associated works : 1
Barbara Plank
Associated works : 1
Diane Litman
Associated works : 1
Daniel C. Burnett
Associated works : 1
Preethum Prithviraj
Associated works : 1
Scott Waterman
Associated works : 1
Jerid Francom
Associated works : 1
Ron Cole
Associated works : 1
Charl van Heerden
Associated works : 1
Marie-Francine Moens
Associated works : 1
Valerie Nygaard
Associated works : 1
Shu-Kai Hsieh
Associated works : 1
Samer Al Moubayed
Associated works : 1
Ludmila DIMITROVA
Associated works : 1
Liisi Piits
Associated works : 1
Steven Bird
Associated works : 1
Alfan Farizki Wicaksono
Associated works : 1
Sabit Hassan
Associated works : 1
Hongzhi Xu
Associated works : 1
Jorge A. Wagner Filho
Associated works : 1
Adam Ussishkin
Associated works : 1
Patricia Robinson
Associated works : 1
Gary Krug
Associated works : 1
Baden Hughes
Associated works : 1
James H. Martin
Associated works : 1
Alessandro Mazzei
Associated works : 1
Peter Walgenbach
Associated works : 1
Katja Markert
Associated works : 1
Jitendra Jonnagaddala
Associated works : 1
Eric McCreath
Associated works : 1
Fernando Sánchez León
Associated works : 1
Roser Morante
Associated works : 1
Soman Kotti Padannayil
Associated works : 1
Lora Aroyo
Associated works : 1
Hanae Koiso
Associated works : 1
Huihsin Tseng
Associated works : 1
Myroslava Dzikovska
Associated works : 1
Mariana Neves
Associated works : 1
Tinghui Duan
Associated works : 1
Robert Voyer
Associated works : 1
Pavel Shkadzko
Associated works : 1
Aibek Makazhanov
Associated works : 1
Hong-Jie Dai
Associated works : 1
Mahsa Yarmohammadi
Associated works : 1
Ilham Fathy Saputra
Associated works : 1
Vinci Liu
Associated works : 1
Manfred Stede
Associated works : 1
Mike Maxwell
Associated works : 1
Lynne Fox
Associated works : 1
Ángel Martín Municio
Associated works : 1
Christian Bonkowski
Associated works : 1
Nancy IDE
Associated works : 1
Ambeswar Gogoi
Associated works : 1
Toshiyuki Takezawa
Associated works : 1
Eric Atwell
Associated works : 1
Yasuyuki Usuda
Associated works : 1
Veselin Stoyanov
Associated works : 1
Pauline W. Githinji
Associated works : 1
Stefani Allegretti
Associated works : 1
William Ogden
Associated works : 1
Qin Lu
Associated works : 1
Lawrence Hunter
Associated works : 1
Martin Johansson
Associated works : 1
Hong Li
Associated works : 1
Joel Tetreault
Associated works : 1
Jekaterina Novikova
Associated works : 1
Simone Teufel
Associated works : 1
Alexander Gruenstein
Associated works : 1
Tin-Shing Chiu
Associated works : 1
Antonio Jimeno Yepes
Associated works : 1
Jaco Badenhorst
Associated works : 1
Michel Généreux
Associated works : 1
Karin Verspoor
Associated works : 1
Damir Cavar
Associated works : 1
Ekaterina Rakhilina
Associated works : 1
Joachim Köhler
Associated works : 1
Ahmed Hussen Abdelaziz
Associated works : 1
Anastasia Vyrenkova
Associated works : 1
Jared Bernstein
Associated works : 1
Feiyu Xu
Associated works : 1
Kenji Imamura
Associated works : 1
Sameer Pradhan
Associated works : 1
Sebastian G.M. Händschke
Associated works : 1
Simple But Not Naïve: Fine-Grained Arabic Dialect Identification Using Only N-Grams
ACL-2019
WS-2019
Sohaila Eltanbouly |
May Bashendy |
Tamer Elsayed |
The MADAR Shared Task on Arabic Fine-Grained Dialect Identification
ACL-2019
WS-2019
Houda Bouamor |
Sabit Hassan |
Nizar Habash |
LIUM-MIRACL Participation in the MADAR Arabic Dialect Identification Shared Task
ACL-2019
WS-2019
Saméh Kchaou |
Fethi Bougares |
Lamia Hadrich-Belguith |
Word Familiarity Rate Estimation Using a Bayesian Linear Mixed Model
EMNLP-2019
WS-2019
Masayuki Asahara |
Multilingual Extension of PDTB-Style Annotation: The Case of TED Multilingual Discourse Bank
LREC-2018
Deniz Zeyrek |
Amália Mendes |
Murathan Kurfalı |
The brWaC Corpus: A New Open Resource for Brazilian Portuguese
LREC-2018
Jorge A. Wagner Filho |
Rodrigo Wilkens |
Marco Idiart |
Aline Villavicencio |
An Italian Twitter Corpus of Hate Speech against Immigrants
LREC-2018
Manuela Sanguinetti |
Fabio Poletto |
Cristina Bosco |
Viviana Patti |
Marco Stranisci |
Towards Continuous Dialogue Corpus Creation: writing to corpus and generating from it
LREC-2018
Andrei Malchanau |
Volha Petukhova |
Harry Bunt |
Construction of the Corpus of Everyday Japanese Conversation: An Interim Report
LREC-2018
Hanae Koiso |
Yasuharu Den |
Yuriko Iseki |
Wakako Kashino |
Yoshiko Kawabata |
Ken’ya Nishikawa |
Yayoi Tanaka |
Yasuyuki Usuda |
Developing the Bangla RST Discourse Treebank
LREC-2018
Debopam Das |
Manfred Stede |
Multilingual Parallel Corpus for Global Communication Plan
LREC-2018
Kenji Imamura |
Eiichiro Sumita |
Resource Interoperability for Sustainable Benchmarking: The Case of Events
LREC-2018
Chantal van Son |
Oana Inel |
Roser Morante |
Lora Aroyo |
Piek Vossen |
Keyphrases Extraction from User-Generated Contents in Healthcare Domain Using Long Short-Term Memory Networks
ACL-2018
WS-2018
Ilham Fathy Saputra |
Rahmad Mahendra |
Alfan Farizki Wicaksono |
Parallel Corpora for the Biomedical Domain
LREC-2018
Aurélie Névéol |
Antonio Jimeno Yepes |
Mariana Neves |
Karin Verspoor |
A Corpus of Corporate Annual and Social Responsibility Reports: 280 Million Tokens of Balanced Organizational Writing
ACL-2018
WS-2018
Sebastian G.M. Händschke |
Sven Buechel |
Jan Goldenstein |
Philipp Poschmann |
Tinghui Duan |
Peter Walgenbach |
Udo Hahn |
Sharing Copies of Synthetic Clinical Corpora without Physical Distribution — A Case Study to Get Around IPRs and Privacy Constraints Featuring the German JSYNCC Corpus
LREC-2018
Christina Lohr |
Sven Buechel |
Udo Hahn |
Transferred Embeddings for Igbo Similarity, Analogy, and Diacritic Restoration Tasks
COLING-2018
SemDeep-2018
WS-2018
Ignatius Ezeani |
Ikechukwu Onyenwe |
Mark Hepple |
Rollenwechsel-English: a large-scale semantic role corpus
LREC-2018
Asad Sayeed |
Pavel Shkadzko |
Vera Demberg |
Measuring the Limit of Semantic Divergence for English Tweets.
RANLP-2017
Dwijen Rudrapal |
Amitava Das |
Annotating Italian Social Media Texts in Universal Dependencies
DepLing-2017
WS-2017
Manuela Sanguinetti |
Cristina Bosco |
Alessandro Mazzei |
Alberto Lavelli |
Fabio Tamburini |
Global Open Resources and Information for Language and Linguistic Analysis (GORILLA)
LREC-2016
Damir Cavar |
Malgorzata Cavar |
Lwin Moe |
Introducing the Asian Language Treebank (ALT)
LREC-2016
Ye Kyaw Thu |
Win Pa Pa |
Masao Utiyama |
Andrew Finch |
Eiichiro Sumita |
An NLP Pipeline for Coptic
LaTeCH-2016
WS-2016
Amir Zeldes |
Caroline T. Schroeder |
The Teams Corpus and Entrainment in Multi-Party Spoken Dialogues
EMNLP-2016
Diane Litman |
Susannah Paletz |
Zahra Rahimi |
Stefani Allegretti |
Caitlin Rice |
A Tangled Web: The Faint Signals of Deception in Text - Boulder Lies and Truth Corpus (BLT-C)
LREC-2016
Franco Salvetti |
John B. Lowe |
James H. Martin |
Building a learner corpus for Russian
NLP4CALL-2016
WS-2016
Ekaterina Rakhilina |
Anastasia Vyrenkova |
Elmira Mustakimova |
Alina Ladygina |
Ivan Smirnov |
TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling
LREC-2016
Ben Verhoeven |
Walter Daelemans |
Barbara Plank |
PARC 3.0: A Corpus of Attribution Relations
LREC-2016
Silvia Pareti |
TMUNSW: Identification of Disorders and Normalization to SNOMED-CT Terminology in Unstructured Clinical Notes
*SEMEVAL-2015
Jitendra Jonnagaddala |
Siaw-Teng Liaw |
Pradeep Ray |
Manish Kumar |
Hong-Jie Dai |
Guest Editoral: Special Issue on Chinese as a Foreign Language
ROCLING/IJCLCLP-2015
Lung-Hao Lee |
Liang-Chih Yu |
Li-Ping Chang |
AMRITA_CEN@SemEval-2015: Paraphrase Detection for Twitter using Unsupervised Feature Learning with Recursive Autoencoders
*SEMEVAL-2015
Mahalakshmi Shanumuga Sundaram |
Anand Kumar Madasamy |
Soman Kotti Padannayil |
Word Alignment Based Parallel Corpora Evaluation and Cleaning Using Machine Learning Techniques
EAMT-2015
WS-2015
Ieva Zariņa |
Pēteris Ņikiforovs |
Raivis Skadiņš |
Why Chinese Web-as-Corpus is Wacky? Or: How Big Data is Killing Chinese Corpus Linguistics
LREC-2014
Shu-Kai Hsieh |
A Hybrid Approach to Features Representation for Fine-grained Arabic Named Entity Recognition
COLING-2014
Fahd Alotaibi |
Mark Lee |
Temporal Annotation in the Clinical Domain
TACL-2014
William F. Styler IV |
Steven Bethard |
Sean Finan |
Martha Palmer |
Sameer Pradhan |
Piet C de Groen |
Brad Erickson |
Timothy Miller |
Chen Lin |
Guergana Savova |
James Pustejovsky |
The Tutorbot Corpus — A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
LREC-2014
Maria Koutsombogera |
Samer Al Moubayed |
Bajibabu Bollepalli |
Ahmed Hussen Abdelaziz |
Martin Johansson |
José David Aguas Lopes |
Jekaterina Novikova |
Catharine Oertel |
Kalin Stefanov |
Gül Varol |
Part-of-speech Tagset and Corpus Development for Igbo, an African Language
LAW-2014
WS-2014
Ikechukwu Onyenwe |
Chinedu Uchechukwu |
Mark Hepple |
Evaluation of different strategies for domain adaptation in opinion mining
LREC-2014
Anne Garcia-Fernandez |
Olivier Ferret |
Marco Dinarelli |
The NewSoMe Corpus: A Unifying Opinion Annotation Framework across Genres and in Multiple Languages
LREC-2014
Roser Saurí |
Judith Domingo |
Toni Badia |
Assembling the Kazakh Language Corpus
EMNLP-2013
Olzhas Makhambetov |
Aibek Makazhanov |
Zhandos Yessenbayev |
Bakhyt Matkarimov |
Islam Sabyrgaliyev |
Anuar Sharafudinov |
Native Language Identification: A Key N-gram Category Approach
BEA-2013
WS-2013
Kristopher Kyle |
Scott Crossley |
Jianmin Dai |
Danielle McNamara |
Lexicon Construction and Corpus Annotation of Historical Language with the CoBaLT Editor
LaTeCH-2012
WS-2012
Tom Kenter |
Tomaž Erjavec |
Maja Žorga Dulmin |
Darja Fišer |
A High-Quality Web Corpus of Czech
LREC-2012
Johanka Spoustová |
Miroslav Spousta |
Annotating Opinions in German Political News
LREC-2012
Hong Li |
Xiwen Cheng |
Kristina Adson |
Tal Kirshboim |
Feiyu Xu |
Beyond SoNaR: towards the facilitation of large corpus building efforts
LREC-2012
Martin Reynaert |
Ineke Schuurman |
Véronique Hoste |
Nelleke Oostdijk |
Maarten van Gompel |
DramaBank: Annotating Agency in Narrative Discourse
LREC-2012
David Elson |
Harvesting Parallel Text in Multiple Languages with Limited Supervision
COLING-2012
Luciano Barbosa |
Vivek Kumar Rangarajan Sridhar |
Mahsa Yarmohammadi |
Srinivas Bangalore |
From Grammar Rule Extraction to Treebanking: A Bootstrapping Approach
LREC-2012
Masood Ghayoomi |
LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis
LREC-2012
Kais Dukes |
Eric Atwell |
Introducing the Reference Corpus of Contemporary Portuguese Online
LREC-2012
Michel Généreux |
Iris Hendrickx |
Amália Mendes |
Iula2Standoff: a tool for creating standoff documents for the IULACT
LREC-2012
Carlos Morell |
Jorge Vivaldi |
Núria Bel |
A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
LREC-2012
Hongzhi Xu |
Helen Kaiyun Chen |
Chu-Ren Huang |
Qin Lu |
Dingxu Shi |
Tin-Shing Chiu |
A Structured Approach for Building Assamese Corpus: Insights, Applications and Challenges
WS-2012
Shikhar Kr. Sarma |
Himadri Bharali |
Ambeswar Gogoi |
Ratul Deka |
Anup Kr. Barman |
A Coherence Model Based on Syntactic Patterns
CoNLL-2012
EMNLP-2012
Annie Louis |
Ani Nenkova |
Findings of the 2011 Workshop on Statistical Machine Translation
WMT-2011
WS-2011
Chris Callison-Burch |
Philipp Koehn |
Christof Monz |
Omar Zaidan |
Building frame-based corpus on the basis of ontological domain knowledge
BioNLP-2011
WS-2011
He Tan |
Rajaram Kaliyaperumal |
Nirupama Benis |
Topic-Based Bengali Opinion Summarization
COLING-2010
Amitava Das |
Sivaji Bandyopadhyay |
Fine-Grained Genre Classification Using Structural Learning Algorithms
ACL-2010
Zhili Wu |
Katja Markert |
Serge Sharoff |
Multilingual Question Generation
LREC-2010
Andrew Hickl |
Arnold Jung |
Ying Shi |
SemEval-2010 Task 7: Argument Selection and Coercion
*SEMEVAL-2010
James Pustejovsky |
Anna Rumshisky |
Alex Plotnick |
Elisabetta Jezek |
Olga Batiukova |
Valeria Quochi |
The Sign Linguistics Corpora Network: Towards Standards for Signed Language Resources
LREC-2010
Onno Crasborn |
Corpus Based Classification of Text in Australian Contracts
ALTA-2010
Michael Curtotti |
Eric McCreath |
Corpora for the Conceptualisation and Zoning of Scientific Papers
LREC-2010
Maria Liakata |
Simone Teufel |
Advaith Siddharthan |
Colin Batchelor |
How Specialized are Specialized Corpora? Behavioral Evaluation of Corpus Representativeness for Maltese.
LREC-2010
Jerid Francom |
Amy LaCross |
Adam Ussishkin |
Enhanced Infrastructure for Creation and Collection of Translation Resources
LREC-2010
Zhiyi Song |
Stephanie Strassel |
Gary Krug |
Kazuaki Maeda |
A Hybrid Model for Annotating Named Entity Training Corpora
LAW-2010
WS-2010
Robert Voyer |
Valerie Nygaard |
Will Fitzgerald |
Hannah Copperman |
SemanticNet-Perception of Human Pragmatics
WS-2010
Amitava Das |
Sivaji Bandyopadhyay |
Tense Sense Disambiguation: A New Syntactic Polysemy Task
EMNLP-2010
Roi Reichart |
Ari Rappoport |
Construction of Chinese Segmented and POS-tagged Conversational Corpora and Their Evaluations on Spontaneous Speech Recognitions
WS-2009
Xinhui Hu |
Ryosuke Isotani |
Satoshi Nakamura |
Collecting and Evaluating Speech Recognition Corpora for Nine Southern Bantu Languages
WS-2009
Jaco Badenhorst |
Charl van Heerden |
Marelie Davel |
Etienne Barnard |
Coupling an Annotated Corpus and a Morphosyntactic Lexicon for State-of-the-Art POS Tagging with Less Human Effort
PACLIC-2009
Pascal Denis |
Benoît Sagot |
Invited Talk: Cross Language Resource Sharing
IJCNLP-2008
Virach Sornlertlamvanich |
Development of Bengali Named Entity Tagged Corpus and its Use in NER Systems
IJCNLP-2008
Asif Ekbal |
Sivaji Bandyopadhyay |
RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus
LREC-2008
Ingunn Amdal |
Ole Morten Strand |
Jørn Almberg |
Torbjørn Svendsen |
The MoveOn Motorcycle Speech Corpus
LREC-2008
Thomas Winkler |
Theodoros Kostoulas |
Richard Adderley |
Christian Bonkowski |
Todor Ganchev |
Joachim Köhler |
Nikos Fakotakis |
Language Resources for Studying Argument
LREC-2008
Chris Reed |
Raquel Mochales Palau |
Glenn Rowe |
Marie-Francine Moens |
An Agreement Measure for Determining Inter-Annotator Reliability of Human Judgements on Affective Text
WS-2008
Plaban Kumar Bhowmick |
Anupam Basu |
Pabitra Mitra |
Multilingual Spoken Language Corpus Development for Communication Research
ROCLING/IJCLCLP-2007
Toshiyuki Takezawa |
Genichiro Kikui |
Masahide Mizushima |
Eiichiro Sumita |
Last Words: Googleology is Bad Science
CL-2007
Adam Kilgarriff |
Designing a Speech Corpus for Estonian Unit Selection Synthesis
NoDaLiDa-2007
WS-2007
Liisi Piits |
Meelis Mihkla |
Tõnis Nurk |
Indrek Kiissel |
H. C. Andersen Conversation Corpus
LREC-2006
Niels Ole Bernsen |
Laila Dybkjær |
Svend Kiilerich |
A framework for real-time dictionary updating
LREC-2006
Cédrick Fairon |
Sébastien Paumier |
A Grapheme-Based Approach for Accent Restoration in Gikuyu
LREC-2006
Peter W. Wagacha |
Guy De Pauw |
Pauline W. Githinji |
Corpus Development and Publication
LREC-2006
Andrew W. Cole |
Champollion: A Robust Parallel Text Sentence Aligner
LREC-2006
Xiaoyi Ma |
NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora
LREC-2006
John Niekrasz |
Alexander Gruenstein |
Frontiers in Linguistic Annotation for Lower-Density Languages
WS-2006
Mike Maxwell |
Baden Hughes |
Words and Word Usage: Newspaper Text versus the Web
ALTA-2005
Vinci Liu |
James R. Curran |
Corpus Design for Biomedical Natural Language Processing
WS-2005
K. Bretonnel Cohen |
Lynne Fox |
Philip V. Ogren |
Lawrence Hunter |
Multi-Perspective Question Answering Using the OpQA Corpus
EMNLP-2005
HLT-2005
Veselin Stoyanov |
Claire Cardie |
Janyce Wiebe |
A Progress Report from the Linguistic Data Consortium: Recent Activities in Resource Creation and Distribution and the Development of Tools and Standards
LREC-2004
Christopher Cieri |
Mark Liberman |
Discourse Annotation in the Monroe Corpus
WS-2004
Joel Tetreault |
Mary Swift |
Preethum Prithviraj |
Myroslava Dzikovska |
James Allen |
Towards Basic Categories for Describing Properties of Texts in a Corpus
LREC-2004
Serge Sharoff |
The American English SALA-II Data Collection
LREC-2004
Peter A. Heeman |
Creation of a Doctor-Patient Dialogue Corpus Using Standardized Patients
LREC-2004
Robert S. Melvin |
Win May |
Shrikanth Narayanan |
Panayiotis Georgiou |
Shadi Ganjavi |
CST Bank: A Corpus for the Study of Cross-document Structural Relationships
LREC-2004
Dragomir Radev |
Jahna Otterbacher |
Zhu Zhang |
A New Approach to the Corpus-based Statistical Investigation of Hungarian Multi-word Lexemes
LREC-2004
Balázs Kis |
Begoña Villada |
Gosse Bouma |
Gábor Ugray |
Tamás Bíró |
Gábor Pohl |
John Nerbonne |
Annotation Tools for Large-Scale Corpus Development: Using AGTK at the Linguistic Data Consortium
LREC-2004
Kazuaki Maeda |
Stephanie Strassel |
Multilingual Resources for Entity Extraction
WS-2003
Stephanie Strassel |
Alexis Mitchell |
An Integrated Term-Based Corpus Query System
EACL-2003
Irena Spasic |
Goran Nenadic |
Kostas Manios |
Sophia Ananiadou |
LREC
Associated works : 56
WS
Associated works : 28
EMNLP
Associated works : 6
ACL
Associated works : 6
COLING
Associated works : 5
HLT
Associated works : 5
ROCLING/IJCLCLP
Associated works : 3
*SEMEVAL
Associated works : 3
LaTeCH
Associated works : 2
ALTA
Associated works : 2
IJCNLP
Associated works : 2
LAW
Associated works : 2
RANLP
Associated works : 1
CoNLL
Associated works : 1
ANLP
Associated works : 1
CL
Associated works : 1
NLP4CALL
Associated works : 1
PACLIC
Associated works : 1
SIGHAN
Associated works : 1
TIPSTER
Associated works : 1
EACL
Associated works : 1
WMT
Associated works : 1
DepLing
Associated works : 1
TACL
Associated works : 1
EAMT
Associated works : 1
NoDaLiDa
Associated works : 1
BioNLP
Associated works : 1
BEA
Associated works : 1
SemDeep
Associated works : 1