NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Marta Banon
Number of Papers:- 7
Number of Citations:- 0
First ACL Paper:- 2018
Latest ACL Paper:- 2025
Venues:-
EAMT
WS
WMT
ACL
LREC
HumEval
EMNLP
Co-Authors:-
Amanda Myntti
Amir Kamran
Andrey Kutuzov
Antonio Toral
Barry Haddow
Similar Authors:-
Vidhisha Balachandran
Cheng Zhang
Enemouh Chioma
Tom Neckermann
2025
2022
2020
2018
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies (HPLT)
ACL
Laurie Burchell |
Ona De Gibert Bonet |
Nikolay Arefyev |
Mikko Aulamo |
Marta Bañón |
Pinzhen Chen |
Mariia Fedorova |
Liane Guillou |
Barry Haddow |
Jan Hajič |
Jindřich Helcl |
Erik Henriksson |
Mateusz Klimaszewski |
Ville Komulainen |
Andrey Kutuzov |
Joona Kytöniemi |
Veronika Laippala |
Petter Mæhlum |
Bhavitvya Malik |
Farrokh Mehryary |
Vladislav Mikhailov |
Nikita Moghe |
Amanda Myntti |
Dayyán O’Brien |
Stephan Oepen |
Proyag Pal |
Jousia Piha |
Sampo Pyysalo |
Gema Ramírez-Sánchez |
David Samuel |
Pavel Stepachev |
Jörg Tiedemann |
Dušan Variš |
Tereza Vojtěchová |
Jaume Zaragoza-Bernabeu |
Human evaluation of web-crawled parallel corpora for machine translation
ACL
HumEval
Gema Ramírez-Sánchez |
Marta Bañón |
Jaume Zaragoza-Bernabeu |
Sergio Ortiz Rojas |
MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages
EAMT
Marta Bañón |
Miquel Esplà-Gomis |
Mikel L. Forcada |
Cristian García-Romero |
Taja Kuzman |
Nikola Ljubešić |
Rik van Noord |
Leopoldo Pla Sempere |
Gema Ramírez-Sánchez |
Peter Rupnik |
Vít Suchomel |
Antonio Toral |
Tobias van der Werff |
Jaume Zaragoza |
Bicleaner AI: Bicleaner Goes Neural
LREC
Jaume Zaragoza-Bernabeu |
Gema Ramírez-Sánchez |
Marta Bañón |
Sergio Ortiz Rojas |
ParaCrawl: Web-Scale Acquisition of Parallel Corpora
ACL
Marta Bañón |
Pinzhen Chen |
Barry Haddow |
Kenneth Heafield |
Hieu Hoang |
Miquel Esplà-Gomis |
Mikel L. Forcada |
Amir Kamran |
Faheem Kirefu |
Philipp Koehn |
Sergio Ortiz Rojas |
Leopoldo Pla Sempere |
Gema Ramírez-Sánchez |
Elsa Sarrías |
Marek Strelec |
Brian Thompson |
William Waites |
Dion Wiggins |
Jaume Zaragoza |
Bifixer and Bicleaner: two open-source tools to clean your parallel data
EAMT
Gema Ramírez-Sánchez |
Jaume Zaragoza-Bernabeu |
Marta Bañón |
Sergio Ortiz Rojas |
Prompsit’s submission to WMT 2018 Parallel Corpus Filtering shared task
EMNLP
WMT
WS
Víctor M. Sánchez-Cartagena |
Marta Bañón |
Sergio Ortiz-Rojas |
Gema Ramírez |
Linguistic
Task
Language
Dataset Type
.