NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora
Pierre Zweigenbaum
|
Serge Sharoff
|
Reinhard Rapp
|
Paper Details:
Month: May
Year: 2018
Location: Miyazaki, Japan
Venue:
LREC |
Citations
URL
No Citations Yet
http://ftp.acc.umu.se/mirror/wikimedia.org/dumps/
http://www.casmacat.eu/corpus/news-commentary.html
https://github.com/attardi/wikiextractor
https://github.com/panyang/MeCab-Chinese
https://github.com/fxsjy/jieba
https://comparable.limsi.fr/bucc2017/bucc2017-task
Field Of Study
Task
Semantic Similarity
Machine Translation
Language
Chinese
English
French
Dataset
News
Similar Papers
Natural Language Processing for Dialectical Arabic: A Survey
Abdulhadi Shoufan
|
Sumaya Alameri
|
Enriching Word Vectors with Subword Information
Piotr Bojanowski
|
Edouard Grave
|
Armand Joulin
|
Tomas Mikolov
|
Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data
Mona Diab
|
Mahmoud Ghoneim
|
Abdelati Hawwari
|
Fahad AlGhamdi
|
Nada AlMarwani
|
Mohamed Al-Badrashiny
|
Semi-supervised Structured Prediction with Neural CRF Autoencoder
Xiao Zhang
|
Yong Jiang
|
Hao Peng
|
Kewei Tu
|
Dan Goldwasser
|
A Survey of Arabic Named Entity Recognition and Classification
Khaled Shaalan
|