NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Mining Very-Non-Parallel Corpora: Parallel Sentence and Lexicon Extraction via Bootstrapping and E
Pascale Fung
|
Percy Cheung
|
Paper Details:
Month: July
Year: 2004
Location: Barcelona, Spain
Venue:
EMNLP |
WS |
SIG: SIGDAT
Citations
URL
Retrieving Bilingual Verb-Noun Collocations by Integrating Cross-Language Category Hierarchies
Fumiyo Fukumoto
|
Yoshimi Suzuki
|
Kazuyuki Yamashita
|
An Empirical Study on Web Mining of Parallel Data
Gumwon Hong
|
Chi-Ho Li
|
Ming Zhou
|
Hae-Chang Rim
|
EM-based Hybrid Model for Bilingual Terminology Extraction from Comparable Corpora
Lianhau Lee
|
Aiti Aw
|
Min Zhang
|
Haizhou Li
|
Inducing Bilingual Lexica From Non-Parallel Data With Earth Mover’s Distance Regularization
Meng Zhang
|
Yang Liu
|
Huanbo Luan
|
Yiqun Liu
|
Maosong Sun
|
Joint Morphological and Syntactic Disambiguation
Shay B. Cohen
|
Noah A. Smith
|
A Study on Bootstrapping Bilingual Vector Spaces from Non-Parallel Data (and Nothing Else)
Ivan Vulić
|
Marie-Francine Moens
|
Fixing Translation Divergences in Parallel Corpora for Neural MT
MinhQuang Pham
|
Josep Crego
|
Jean Senellart
|
François Yvon
|
On the Use of Comparable Corpora to Improve SMT performance
Sadaf Abdul-Rauf
|
Holger Schwenk
|
Detecting Highly Confident Word Translations from Comparable Corpora without Any Prior Knowledge
Ivan Vulić
|
Marie-Francine Moens
|
Inversion Transduction Grammar Constraints for Mining Parallel Sentences from Quasi-Comparable Corpora
Dekai Wu
|
Pascale Fung
|
Context-Aware Smoothing for Neural Machine Translation
Kehai Chen
|
Rui Wang
|
Masao Utiyama
|
Eiichiro Sumita
|
Tiejun Zhao
|
Improving Machine Translation Performance by Exploiting Non-Parallel Corpora
Dragos Stefan Munteanu
|
Daniel Marcu
|
Exploiting a Large Strongly Comparable Corpus
Thierry Etchegoyhen
|
Andoni Azpeitia
|
Naiara Pérez
|
A Simple Sentence-Level Extraction Algorithm for Comparable Data
Christoph Tillmann
|
Jian-ming Xu
|
Automatic Parallel Fragment Extraction from Noisy Data
Jason Riesa
|
Daniel Marcu
|
Cross-Lingual Semantic Similarity of Words as the Similarity of Their Semantic Word Responses
Ivan Vulić
|
Marie-Francine Moens
|
Bootstrapping Translation Detection and Sentence Extraction from Comparable Corpora
Kriste Krstovski
|
David Smith
|
Extracting Parallel Sub-Sentential Fragments from Non-Parallel Corpora
Dragos Stefan Munteanu
|
Daniel Marcu
|
A Beam-Search Extraction Algorithm for Comparable Data
Christoph Tillmann
|
Agreement-based Learning of Parallel Lexicons and Phrases from Non-Parallel Corpora
Chunyang Liu
|
Yang Liu
|
Maosong Sun
|
Huanbo Luan
|
Heng Yu
|
Set-Theoretic Alignment for Comparable Corpora
Thierry Etchegoyhen
|
Andoni Azpeitia
|
Adversarial Training for Unsupervised Bilingual Lexicon Induction
Meng Zhang
|
Yang Liu
|
Huanbo Luan
|
Maosong Sun
|
Efficient Extraction of Pseudo-Parallel Sentences from Raw Monolingual Data Using Word Embeddings
Benjamin Marie
|
Atsushi Fujita
|
Extracting Lexically Divergent Paraphrases from Twitter
Wei Xu
|
Alan Ritter
|
Chris Callison-Burch
|
William B. Dolan
|
Yangfeng Ji
|
Phrase Table Induction Using In-Domain Monolingual Data for Domain Adaptation in Statistical Machine Translation
Benjamin Marie
|
Atsushi Fujita
|
Utilizing Contextually Relevant Terms in Bilingual Lexicon Extraction
Azniah Ismail
|
Suresh Manandhar
|
Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora
Emmanuel Morin
|
Emmanuel Prochasson
|
Paraphrase Fragment Extraction from Monolingual Comparable Corpora
Rui Wang
|
Chris Callison-Burch
|
Extracting Parallel Phrases from Comparable Data
Sanjika Hewavitharana
|
Stephan Vogel
|
Identifying Parallel Documents from a Large Bilingual Collection of Texts: Application to Parallel Article Extraction in Wikipedia.
Alexandre Patry
|
Philippe Langlais
|
An Expectation Maximization Algorithm for Textual Unit Alignment
Radu Ion
|
Alexandru Ceauşu
|
Elena Irimia
|
A Minimally Supervised Approach for Detecting and Ranking Document Translation Pairs
Kriste Krstovski
|
David A. Smith
|
Measuring Comparability of Documents in Non-Parallel Corpora for Efficient Extraction of (Semi-)Parallel Translation Equivalents
Fangzhong Su
|
Bogdan Babych
|
Twitter Translation using Translation-Based Cross-Lingual Retrieval
Laura Jehl
|
Felix Hieber
|
Stefan Riezler
|
Mining for Domain-specific Parallel Text from Wikipedia
Magdalena Plamadă
|
Martin Volk
|
Topic Models + Word Alignment = A Flexible Framework for Extracting Bilingual Dictionary from Comparable Corpus
Xiaodong Liu
|
Kevin Duh
|
Yuji Matsumoto
|
A Factory of Comparable Corpora from Wikipedia
Alberto Barrón-Cedeño
|
Cristina España-Bonet
|
Josu Boldoba
|
Lluís Màrquez
|
DOCAL - Vicomtech’s Participation in the WMT16 Shared Task on Bilingual Document Alignment
Andoni Azpeitia
|
Thierry Etchegoyhen
|
A Portable Method for Parallel and Comparable Document Alignment
Thierry Etchegoyhen
|
Andoni Azpeitia
|
Weighted Set-Theoretic Alignment of Comparable Sentences
Andoni Azpeitia
|
Thierry Etchegoyhen
|
Eva Martínez Garcia
|
Identification of Parallel Sentences in Comparable Monolingual Corpora from Different Registers
Rémi Cardon
|
Natalia Grabar
|
CLEAR – Simple Corpus for Medical French
Natalia Grabar
|
Rémi Cardon
|
http://www.cs.unt.edu/~rada/wpt/
Field Of Study
Task
Information Retrieval
Machine Translation
Paraphrasing
Language
Multilingual
Chinese
English
Arabic
Dataset
News
Similar Papers
Natural Language Processing for Dialectical Arabic: A Survey
Abdulhadi Shoufan
|
Sumaya Alameri
|
Enriching Word Vectors with Subword Information
Piotr Bojanowski
|
Edouard Grave
|
Armand Joulin
|
Tomas Mikolov
|
Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data
Mona Diab
|
Mahmoud Ghoneim
|
Abdelati Hawwari
|
Fahad AlGhamdi
|
Nada AlMarwani
|
Mohamed Al-Badrashiny
|
Semi-supervised Structured Prediction with Neural CRF Autoencoder
Xiao Zhang
|
Yong Jiang
|
Hao Peng
|
Kewei Tu
|
Dan Goldwasser
|
A Survey of Arabic Named Entity Recognition and Classification
Khaled Shaalan
|