NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Improving Machine Translation Performance by Exploiting Non-Parallel Corpora
Dragos Stefan Munteanu
|
Daniel Marcu
|
Paper Details:
Year: 2005
Venue:
CL |
Citations
URL
Domain Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual Corpora
Hua Wu
|
Haifeng Wang
|
Chengqing Zong
|
An Empirical Study on Web Mining of Parallel Data
Gumwon Hong
|
Chi-Ho Li
|
Ming Zhou
|
Hae-Chang Rim
|
Large Scale Parallel Document Mining for Machine Translation
Jakob Uszkoreit
|
Jay Ponte
|
Ashok Popat
|
Moshe Dubiner
|
Contextual Modeling for Meeting Translation Using Unsupervised Word Sense Disambiguation
Mei Yang
|
Katrin Kirchhoff
|
Enhancing Multi-lingual Information Extraction via Cross-Media Inference and Fusion
Adam Lee
|
Marissa Passantino
|
Heng Ji
|
Guojun Qi
|
Thomas Huang
|
EM-based Hybrid Model for Bilingual Terminology Extraction from Comparable Corpora
Lianhau Lee
|
Aiti Aw
|
Min Zhang
|
Haizhou Li
|
Query Lattice for Translation Retrieval
Meiping Dong
|
Yong Cheng
|
Yang Liu
|
Jia Xu
|
Maosong Sun
|
Tatsuya Izuha
|
Jie Hao
|
Learning to translate from graded and negative relevance information
Laura Jehl
|
Stefan Riezler
|
Deep Neural Networks at the Service of Multilingual Parallel Sentence Extraction
Ahmad Aghaebrahimian
|
Extracting Parallel Sentences with Bidirectional Recurrent Neural Networks to Improve Machine Translation
Francis Grégoire
|
Philippe Langlais
|
Improving Statistical Machine Translation Performance by Training Data Selection and Optimization
Yajuan Lü
|
Jin Huang
|
Qun Liu
|
Language and Translation Model Adaptation using Comparable Corpora
Matthew Snover
|
Bonnie Dorr
|
Richard Schwartz
|
Improved Statistical Machine Translation Using Monolingually-Derived Paraphrases
Yuval Marton
|
Chris Callison-Burch
|
Philip Resnik
|
Translingual Document Representations from Discriminative Projections
John Platt
|
Kristina Toutanova
|
Wen-tau Yih
|
Example-Based Paraphrasing for Improved Phrase-Based Statistical Machine Translation
Aurélien Max
|
Improving Bilingual Projections via Sparse Covariance Matrices
Jagadeesh Jagarlamudi
|
Raghavendra Udupa
|
Hal Daumé III
|
Abhijit Bhole
|
Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation.
Ashish Venugopal
|
Jakob Uszkoreit
|
David Talbot
|
Franz Och
|
Juri Ganitkevitch
|
Measuring the behavioral impact of machine translation quality improvements with A/B testing
Ben Russell
|
Duncan Gillespie
|
Fine-grained Coordinated Cross-lingual Text Stream Alignment for Endless Language Knowledge Acquisition
Tao Ge
|
Qing Dou
|
Heng Ji
|
Lei Cui
|
Baobao Chang
|
Zhifang Sui
|
Furu Wei
|
Ming Zhou
|
Fixing Translation Divergences in Parallel Corpora for Neural MT
MinhQuang Pham
|
Josep Crego
|
Jean Senellart
|
François Yvon
|
On the Use of Comparable Corpora to Improve SMT performance
Sadaf Abdul-Rauf
|
Holger Schwenk
|
Detecting Highly Confident Word Translations from Comparable Corpora without Any Prior Knowledge
Ivan Vulić
|
Marie-Francine Moens
|
A Knowledge-based Representation for Cross-Language Document Retrieval and Categorization
Marc Franco-Salvador
|
Paolo Rosso
|
Roberto Navigli
|
Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion
Kazuaki Maeda
|
Xiaoyi Ma
|
Stephanie Strassel
|
Enrichment of Bilingual Dictionary through News Stream Data
Ajay Dubey
|
Parth Gupta
|
Vasudeva Varma
|
Paolo Rosso
|
Dual Subtitles as Parallel Corpora
Shikun Zhang
|
Wang Ling
|
Chris Dyer
|
Constructing a Chinese—Japanese Parallel Corpus from Wikipedia
Chenhui Chu
|
Toshiaki Nakazawa
|
Sadao Kurohashi
|
Parallel Sentence Extraction from Comparable Corpora with Neural Network Features
Chenhui Chu
|
Raj Dabre
|
Sadao Kurohashi
|
Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor’s Love Affair
Nikola Ljubešić
|
Miquel Esplà-Gomis
|
Antonio Toral
|
Sergio Ortiz Rojas
|
Filip Klubička
|
Exploiting a Large Strongly Comparable Corpus
Thierry Etchegoyhen
|
Andoni Azpeitia
|
Naiara Pérez
|
A Comparison of Pivot Methods for Phrase-Based Statistical Machine Translation
Masao Utiyama
|
Hitoshi Isahara
|
A Simple Sentence-Level Extraction Algorithm for Comparable Data
Christoph Tillmann
|
Jian-ming Xu
|
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
Jason R. Smith
|
Chris Quirk
|
Kristina Toutanova
|
Why Not Grab a Free Lunch? Mining Large Corpora for Parallel Sentences to Improve Translation Modeling
Ferhan Ture
|
Jimmy Lin
|
Bootstrapping Translation Detection and Sentence Extraction from Comparable Corpora
Kriste Krstovski
|
David Smith
|
Identifying Semantic Divergences in Parallel Text without Annotations
Yogarshi Vyas
|
Xing Niu
|
Marine Carpuat
|
Neural Machine Translation for Low Resource Languages using Bilingual Lexicon Induced from Comparable Corpora
Sree Harsha Ramesh
|
Krishna Prasad Sankaranarayanan
|
Extracting Parallel Sub-Sentential Fragments from Non-Parallel Corpora
Dragos Stefan Munteanu
|
Daniel Marcu
|
Benefits of the ‘Massively Parallel Rosetta Stone’: Cross-Language Information Retrieval with over 30 Languages
Peter Chew
|
Ahmed Abdelali
|
Mining Bilingual Data from the Web with Adaptively Learnt Patterns
Long Jiang
|
Shiquan Yang
|
Ming Zhou
|
Xiaohua Liu
|
Qingsheng Zhu
|
A Beam-Search Extraction Algorithm for Comparable Data
Christoph Tillmann
|
Crowdsourcing Translation: Professional Quality from Non-Professionals
Omar F. Zaidan
|
Chris Callison-Burch
|
From Bilingual Dictionaries to Interlingual Document Representations
Jagadeesh Jagarlamudi
|
Hal Daumé III
|
Raghavendra Udupa
|
Two Easy Improvements to Lexical Weighting
David Chiang
|
Steve DeNeefe
|
Michael Pust
|
Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information
Jinsong Su
|
Hua Wu
|
Haifeng Wang
|
Yidong Chen
|
Xiaodong Shi
|
Huailin Dong
|
Qun Liu
|
Cross-Lingual Mixture Model for Sentiment Classification
Xinfan Meng
|
Furu Wei
|
Xiaohua Liu
|
Ming Zhou
|
Ge Xu
|
Houfeng Wang
|
Dirt Cheap Web-Scale Parallel Text from the Common Crawl
Jason R. Smith
|
Herve Saint-Amand
|
Magdalena Plamada
|
Philipp Koehn
|
Chris Callison-Burch
|
Adam Lopez
|
Task Alternation in Parallel Sentence Retrieval for Twitter Translation
Felix Hieber
|
Laura Jehl
|
Stefan Riezler
|
Bilingual Data Cleaning for SMT using Graph-based Random Walk
Lei Cui
|
Dongdong Zhang
|
Shujie Liu
|
Mu Li
|
Ming Zhou
|
Are Two Heads Better than One? Crowdsourced Translation via a Two-Step Collaboration of Non-Professional Translators and Editors
Rui Yan
|
Mingkun Gao
|
Ellie Pavlick
|
Chris Callison-Burch
|
Automatic Detection of Multilingual Dictionaries on the Web
Gintarė Grigonytė
|
Timothy Baldwin
|
A Multi-media Approach to Cross-lingual Entity Knowledge Transfer
Di Lu
|
Xiaoman Pan
|
Nima Pourdamghani
|
Shih-Fu Chang
|
Heng Ji
|
Kevin Knight
|
Set-Theoretic Alignment for Comparable Corpora
Thierry Etchegoyhen
|
Andoni Azpeitia
|
An Unsupervised Method for Automatic Translation Memory Cleaning
Masoud Jalili Sabet
|
Matteo Negri
|
Marco Turchi
|
Eduard Barbu
|
Efficient Extraction of Pseudo-Parallel Sentences from Raw Monolingual Data Using Word Embeddings
Benjamin Marie
|
Atsushi Fujita
|
Filtering and Mining Parallel Data in a Joint Multilingual Space
Holger Schwenk
|
Translating a Language You Don’t Know In the Chinese Room
Ulf Hermjakob
|
Jonathan May
|
Michael Pust
|
Kevin Knight
|
Phrase Table Induction Using In-Domain Monolingual Data for Domain Adaptation in Statistical Machine Translation
Benjamin Marie
|
Atsushi Fujita
|
Ensembles of Classifiers for Cleaning Web Parallel Corpora and Translation Memories
Eduard Barbu
|
HDU: Cross-lingual Textual Entailment with SMT Features
Katharina Wäschle
|
Sascha Fendrich
|
Frontiers in Linguistic Annotation for Lower-Density Languages
Mike Maxwell
|
Baden Hughes
|
Evaluation of the Bible as a Resource for Cross-Language Information Retrieval
Peter A. Chew
|
Steve J. Verzi
|
Travis L. Bauer
|
Jonathan T. McClain
|
Utilizing Citations of Foreign Words in Corpus-Based Dictionary Generation
Reinhard Rapp
|
Michael Zock
|
The Noisier the Better: Identifying Multilingual Word Translations Using a Single Monolingual Corpus
Reinhard Rapp
|
Michael Zock
|
Andrew Trotman
|
Yue Xu
|
Mining Large-scale Parallel Corpora from Multilingual Patents: An English-Chinese example and its application to SMT
Bin Lu
|
Benjamin K. Tsou
|
Tao Jiang
|
Oi Yee Kwong
|
Jingbo Zhu
|
Extracting Parallel Fragments from Comparable Corpora for Data-to-text Generation
Anja Belz
|
Eric Kow
|
Learning Discriminative Projections for Text Similarity Measures
Wen-tau Yih
|
Kristina Toutanova
|
John C. Platt
|
Christopher Meek
|
Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora
Emmanuel Morin
|
Emmanuel Prochasson
|
Two Ways to Use a Noisy Parallel News Corpus for Improving Statistical Machine Translation
Souhir Gahbiche-Braham
|
Hélène Bonneau-Maynard
|
François Yvon
|
Paraphrase Fragment Extraction from Monolingual Comparable Corpora
Rui Wang
|
Chris Callison-Burch
|
Extracting Parallel Phrases from Comparable Data
Sanjika Hewavitharana
|
Stephan Vogel
|
Active Learning with Multiple Annotations for Comparable Data Classification Task
Vamshi Ambati
|
Sanjika Hewavitharana
|
Stephan Vogel
|
Jaime Carbonell
|
Comparable Fora
Johanka Spoustová
|
Miroslav Spousta
|
Unsupervised Alignment of Comparable Data and Text Resources
Anja Belz
|
Eric Kow
|
An Expectation Maximization Algorithm for Textual Unit Alignment
Radu Ion
|
Alexandru Ceauşu
|
Elena Irimia
|
A Minimally Supervised Approach for Detecting and Ranking Document Translation Pairs
Kriste Krstovski
|
David A. Smith
|
The Karlsruhe Institute of Technology Translation Systems for the WMT 2011
Teresa Herrmann
|
Mohammed Mediani
|
Jan Niehues
|
Alex Waibel
|
CMU Haitian Creole-English Translation System for WMT 2011
Sanjika Hewavitharana
|
Nguyen Bach
|
Qin Gao
|
Vamshi Ambati
|
Stephan Vogel
|
Measuring Comparability of Documents in Non-Parallel Corpora for Efficient Extraction of (Semi-)Parallel Translation Equivalents
Fangzhong Su
|
Bogdan Babych
|
Design of a hybrid high quality machine translation system
Bogdan Babych
|
Kurt Eberle
|
Johanna Geiß
|
Mireia Ginestí-Rosell
|
Anthony Hartley
|
Reinhard Rapp
|
Serge Sharoff
|
Martin Thomas
|
Aligning Bilingual Literary Works: a Pilot Study
Qian Yu
|
Aurélien Max
|
François Yvon
|
Twitter Translation using Translation-Based Cross-Lingual Retrieval
Laura Jehl
|
Felix Hieber
|
Stefan Riezler
|
Evaluating (and Improving) Sentence Alignment under Noisy Conditions
Omar Zaidan
|
Vishal Chowdhary
|
Cross-lingual WSD for Translation Extraction from Comparable Corpora
Marianna Apidianaki
|
Nikola Ljubešić
|
Darja Fišer
|
Chinese–Japanese Parallel Sentence Extraction from Quasi–Comparable Corpora
Chenhui Chu
|
Toshiaki Nakazawa
|
Sadao Kurohashi
|
Mining for Domain-specific Parallel Text from Wikipedia
Magdalena Plamadă
|
Martin Volk
|
Crowdsourcing High-Quality Parallel Data Extraction from Twitter
Wang Ling
|
Luís Marujo
|
Chris Dyer
|
Alan W. Black
|
Isabel Trancoso
|
DOCAL - Vicomtech’s Participation in the WMT16 Shared Task on Bilingual Document Alignment
Andoni Azpeitia
|
Thierry Etchegoyhen
|
BAD LUC@WMT 2016: a Bilingual Document Alignment Platform Based on Lucene
Laurent Jakubina
|
Phillippe Langlais
|
Users and Data: The Two Neglected Children of Bilingual Natural Language Processing Research
Phillippe Langlais
|
Weighted Set-Theoretic Alignment of Comparable Sentences
Andoni Azpeitia
|
Thierry Etchegoyhen
|
Eva Martínez Garcia
|
BUCC 2017 Shared Task: a First Attempt Toward a Deep Learning Framework for Identifying Parallel Sentences in Comparable Corpora
Francis Grégoire
|
Philippe Langlais
|
zNLP: Identifying Parallel Sentences in Chinese-English Comparable Corpora
Zheng Zhang
|
Pierre Zweigenbaum
|
Detecting Cross-Lingual Semantic Divergence for Neural Machine Translation
Marine Carpuat
|
Yogarshi Vyas
|
Xing Niu
|
Projection of Argumentative Corpora from Source to Target Languages
Ahmet Aker
|
Huangpan Zhang
|
MultiNews: A Web collection of an Aligned Multimodal and Multilingual Corpus
Haithem Afli
|
Pintu Lohar
|
Andy Way
|
Identification of Parallel Sentences in Comparable Monolingual Corpora from Different Registers
Rémi Cardon
|
Natalia Grabar
|
Effective Parallel Corpus Mining using Bilingual Sentence Embeddings
Mandy Guo
|
Qinlan Shen
|
Yinfei Yang
|
Heming Ge
|
Daniel Cer
|
Gustavo Hernandez Abrego
|
Keith Stevens
|
Noah Constant
|
Yun-Hsuan Sung
|
Brian Strope
|
Ray Kurzweil
|
STACC, OOV Density and N-gram Saturation: Vicomtech’s Participation in the WMT 2018 Shared Task on Parallel Corpus Filtering
Andoni Azpeitia
|
Thierry Etchegoyhen
|
Eva Martínez Garcia
|
Prompsit’s submission to WMT 2018 Parallel Corpus Filtering shared task
Víctor M. Sánchez-Cartagena
|
Marta Bañón
|
Sergio Ortiz-Rojas
|
Gema Ramírez
|
http://www.ldc.upenn.edu
http://www-2.cs.cmu.edu/lemur
http://www.fjoch.com/YASMET.html
http://www.nist.gov/speech/tests/mt
http://www.usc.edu/hpcc
Field Of Study
Linguistic Trends
Discourse
Task
Tagging
Information Retrieval
Question Answering
Machine Translation
Approach
Generative Model
Language
Multilingual
Chinese
English
French
Arabic
Dataset
News
Similar Papers
Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data
Mona Diab
|
Mahmoud Ghoneim
|
Abdelati Hawwari
|
Fahad AlGhamdi
|
Nada AlMarwani
|
Mohamed Al-Badrashiny
|
Sentiment after Translation: A Case-Study on Arabic Social Media Posts
Mohammad Salameh
|
Saif Mohammad
|
Svetlana Kiritchenko
|
Natural Language Processing for Dialectical Arabic: A Survey
Abdulhadi Shoufan
|
Sumaya Alameri
|
Semi-supervised Structured Prediction with Neural CRF Autoencoder
Xiao Zhang
|
Yong Jiang
|
Hao Peng
|
Kewei Tu
|
Dan Goldwasser
|
A Survey of Arabic Named Entity Recognition and Classification
Khaled Shaalan
|