NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Nikola Ljubesic
Number of Papers:- 87
Number of Citations:- 311
First ACL Paper:- 2008
Latest ACL Paper:- 2024
Venues:-
WMT
EACL
SemEval
PEOPLES
RANLP
EAMT
TACL
LAW
SIGUL
VarDial
MWE
EMNLP
WASSA
BUCC
BSNLP
ParlaCLARIN
WNUT
WAC
WS
LREC
NLP+CSS
NAACL
ALW
ACL
LT4VAR
COLING
Co-Authors:-
Abigail Walsh
Adrian Gabriel Chifu
Agata Savary
Ahmed Ali
Alan Ramponi
Similar Authors:-
Eddi Gbery
Andrea Agili
Carl Camilleri
Manuel Zini
Manuela Yapomo
2024
2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2008
Proceedings of the Eleventh Workshop on NLP for Similar Languages, Varieties, and Dialects (VarDial 2024)
VarDial
WS
Yves Scherrer |
Tommi Jauhiainen |
Nikola Ljubešić |
Marcos Zampieri |
Preslav Nakov |
Jörg Tiedemann |
JSI and WüNLP at the DIALECT-COPA Shared Task: In-Context Learning From Just a Few Dialectal Examples Gets You Quite Far
VarDial
WS
Nikola Ljubešić |
Taja Kuzman |
Peter Rupnik |
Ivan Vulić |
Fabian Schmidt |
Goran Glavaš |
Multilingual Power and Ideology identification in the Parliament: a reference dataset and simple baselines
ParlaCLARIN
WS
Çağrı Çöltekin |
Matyáš Kopp |
Meden Katja |
Vaidas Morkevicius |
Nikola Ljubešić |
Tomaž Erjavec |
Geographic Adaptation of Pretrained Language Models
TACL
Valentin Hofmann |
Goran Glavaš |
Nikola Ljubešić |
Janet B. Pierrehumbert |
Hinrich Schütze |
DIALECT-COPA: Extending the Standard Translations of the COPA Causal Commonsense Reasoning Dataset to South Slavic Dialects
VarDial
WS
Nikola Ljubešić |
Nada Galant |
Sonja Benčina |
Jaka Čibej |
Stefan Milosavljević |
Peter Rupnik |
Taja Kuzman |
Language Models on a Diet: Cost-Efficient Development of Encoders for Closely-Related Languages via Additional Pretraining
SIGUL
WS
Nikola Ljubešić |
Vít Suchomel |
Peter Rupnik |
Taja Kuzman |
Rik van Noord |
VarDial Evaluation Campaign 2024: Commonsense Reasoning in Dialects and Multi-Label Similar Language Identification
VarDial
WS
Adrian-Gabriel Chifu |
Goran Glavaš |
Radu Tudor Ionescu |
Nikola Ljubešić |
Aleksandra Miletić |
Filip Miletić |
Yves Scherrer |
Ivan Vulić |
Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark
NAACL
Stephen Mayhew |
Terra Blevins |
Shuheng Liu |
Marek Suppa |
Hila Gonen |
Joseph Marvin Imperial |
Börje Karlsson |
Peiqin Lin |
Nikola Ljubešić |
Lester James Miranda |
Barbara Plank |
Arij Riabi |
Yuval Pinter |
PARSEME corpus release 1.3
MWE
Agata Savary |
Cherifa Ben Khelil |
Carlos Ramisch |
Voula Giouli |
Verginica Barbu Mititelu |
Najet Hadj Mohamed |
Cvetana Krstev |
Chaya Liebeskind |
Hongzhi Xu |
Sara Stymne |
Tunga Güngör |
Thomas Pickard |
Bruno Guillaume |
Eduard Bejček |
Archna Bhatia |
Marie Candito |
Polona Gantar |
Uxoa Iñurrieta |
Albert Gatt |
Jolanta Kovalevskaite |
Timm Lichte |
Nikola Ljubešić |
Johanna Monti |
Carla Parra Escartín |
Mehrnoush Shamsfard |
Ivelina Stoyanova |
Veronika Vincze |
Abigail Walsh |
MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages
EAMT
Marta Bañón |
Miquel Esplà-Gomis |
Mikel L. Forcada |
Cristian García-Romero |
Taja Kuzman |
Nikola Ljubešić |
Rik van Noord |
Leopoldo Pla Sempere |
Gema Ramírez-Sánchez |
Peter Rupnik |
Vít Suchomel |
Antonio Toral |
Tobias van der Werff |
Jaume Zaragoza |
ParlaMint II: The Show Must Go On
LREC
ParlaCLARIN
Maciej Ogrodniczuk |
Petya Osenova |
Tomaž Erjavec |
Darja Fišer |
Nikola Ljubešić |
Çağrı Çöltekin |
Matyáš Kopp |
Meden Katja |
ParlaSpeech-HR - a Freely Available ASR Dataset for Croatian Bootstrapped from the ParlaMint Corpus
LREC
ParlaCLARIN
Nikola Ljubešić |
Danijel Koržinek |
Peter Rupnik |
Ivo-Pavao Jazbec |
The GINCO Training Dataset for Web Genre Identification of Documents Out in the Wild
LREC
Taja Kuzman |
Peter Rupnik |
Nikola Ljubešić |
Extending the SSJ Universal Dependencies Treebank for Slovenian: Was It Worth It?
LAW
LREC
Kaja Dobrovoljc |
Nikola Ljubešić |
Proceedings of the Ninth Workshop on NLP for Similar Languages, Varieties and Dialects
COLING
VarDial
WS
Yves Scherrer |
Tommi Jauhiainen |
Nikola Ljubešić |
Preslav Nakov |
Jörg Tiedemann |
Marcos Zampieri |
Findings of the VarDial Evaluation Campaign 2021
EACL
VarDial
Bharathi Raja Chakravarthi |
Gaman Mihaela |
Radu Tudor Ionescu |
Heidi Jauhiainen |
Tommi Jauhiainen |
Krister Lindén |
Nikola Ljubešić |
Niko Partanen |
Ruba Priyadharshini |
Christoph Purschke |
Eswari Rajagopal |
Yves Scherrer |
Marcos Zampieri |
Exploring Stylometric and Emotion-Based Features for Multilingual Cross-Domain Hate Speech Detection
EACL
WASSA
Ilia Markov |
Nikola Ljubešić |
Darja Fišer |
Walter Daelemans |
Social Media Variety Geolocation with geoBERT
EACL
VarDial
Yves Scherrer |
Nikola Ljubešić |
Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects
EACL
VarDial
Marcos Zampieri |
Preslav Nakov |
Nikola Ljubešić |
Jörg Tiedemann |
Yves Scherrer |
Tommi Jauhiainen |
BERTić - The Transformer Language Model for Bosnian, Croatian, Montenegrin and Serbian
BSNLP
EACL
Nikola Ljubešić |
Davor Lauc |
Sesame Street to Mount Sinai: BERT-constrained character-level Moses models for multilingual lexical normalization
EMNLP
WNUT
Yves Scherrer |
Nikola Ljubešić |
MultiLexNorm: A Shared Task on Multilingual Lexical Normalization
EMNLP
WNUT
Rob van der Goot |
Alan Ramponi |
Arkaitz Zubiaga |
Barbara Plank |
Benjamin Muller |
Iñaki San Vicente Roncal |
Nikola Ljubešić |
Özlem Çetinoğlu |
Rahmad Mahendra |
Talha Çolakoğlu |
Timothy Baldwin |
Tommaso Caselli |
Wladimir Sidorenko |
Cultural Topic Modelling over Novel Wikipedia Corpora for South-Slavic Languages
RANLP
Filip Markoski |
Elena Markoska |
Nikola Ljubešić |
Eftim Zdravevski |
Ljupco Kocarev |
SemEval-2020 Task 3: Graded Word Similarity in Context
COLING
SemEval
Carlos Santos Armendariz |
Matthew Purver |
Senja Pollak |
Nikola Ljubešić |
Matej Ulčar |
Ivan Vulić |
Mohammad Taher Pilehvar |
HeLju@VarDial 2020: Social Media Variety Geolocation with BERT Models
COLING
VarDial
Yves Scherrer |
Nikola Ljubešić |
Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects
COLING
VarDial
Marcos Zampieri |
Preslav Nakov |
Nikola Ljubešić |
Jörg Tiedemann |
Yves Scherrer |
The LiLaH Emotion Lexicon of Croatian, Dutch and Slovene
COLING
PEOPLES
Nikola Ljubešić |
Ilia Markov |
Darja Fišer |
Walter Daelemans |
A Report on the VarDial Evaluation Campaign 2020
COLING
VarDial
Mihaela Gaman |
Dirk Hovy |
Radu Tudor Ionescu |
Heidi Jauhiainen |
Tommi Jauhiainen |
Krister Lindén |
Nikola Ljubešić |
Niko Partanen |
Christoph Purschke |
Yves Scherrer |
Marcos Zampieri |
Findings of the 2020 Conference on Machine Translation (WMT20)
EMNLP
WMT
Loïc Barrault |
Magdalena Biesialska |
Ondřej Bojar |
Marta R. Costa-jussà |
Christian Federmann |
Yvette Graham |
Roman Grundkiewicz |
Barry Haddow |
Matthias Huck |
Eric Joanis |
Tom Kocmi |
Philipp Koehn |
Chi-kiu Lo |
Nikola Ljubešić |
Christof Monz |
Makoto Morishita |
Masaaki Nagata |
Toshiaki Nakazawa |
Santanu Pal |
Matt Post |
Marcos Zampieri |
Gigafida 2.0: The Reference Corpus of Written Standard Slovene
LREC
Simon Krek |
Špela Arhar Holdt |
Tomaž Erjavec |
Jaka Čibej |
Andraz Repar |
Polona Gantar |
Nikola Ljubešić |
Iztok Kosem |
Kaja Dobrovoljc |
CoSimLex: A Resource for Evaluating Graded Word Similarity in Context
LREC
Carlos Santos Armendariz |
Matthew Purver |
Matej Ulčar |
Senja Pollak |
Nikola Ljubešić |
Mark Granroth-Wilding |
Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects
NAACL
WS
Marcos Zampieri |
Preslav Nakov |
Shervin Malmasi |
Nikola Ljubešić |
Jörg Tiedemann |
Ahmed Ali |
What does Neural Bring? Analysing Improvements in Morphosyntactic Annotation and Lemmatisation of Slovenian, Croatian and Serbian
ACL
WS
Nikola Ljubešić |
Kaja Dobrovoljc |
Improving UD processing via satellite resources for morphology
WS
Kaja Dobrovoljc |
Tomaž Erjavec |
Nikola Ljubešić |
Bleaching Text: Abstract Features for Cross-lingual Gender Prediction
ACL
Rob van der Goot |
Nikola Ljubešić |
Ian Matroos |
Malvina Nissim |
Barbara Plank |
Predicting Concreteness and Imageability of Words Within and Across Languages via Word Embeddings
ACL
WS
Nikola Ljubešić |
Darja Fišer |
Anita Peti-Stantić |
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018)
COLING
VarDial
WS
Marcos Zampieri |
Preslav Nakov |
Nikola Ljubešić |
Jörg Tiedemann |
Shervin Malmasi |
Ahmed Ali |
Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign
COLING
VarDial
WS
Marcos Zampieri |
Shervin Malmasi |
Preslav Nakov |
Ahmed Ali |
Suwon Shon |
James Glass |
Yves Scherrer |
Tanja Samardžić |
Nikola Ljubešić |
Jörg Tiedemann |
Chris van der Lee |
Stefan Grondelaers |
Nelleke Oostdijk |
Dirk Speelman |
Antal van den Bosch |
Ritesh Kumar |
Bornini Lahiri |
Mayank Jain |
Comparing CRF and LSTM performance on the task of morphosyntactic tagging of non-standard varieties of South Slavic languages
COLING
VarDial
WS
Nikola Ljubešić |
Datasets of Slovene and Croatian Moderated News Comments
EMNLP
WS
Nikola Ljubešić |
Tomaž Erjavec |
Darja Fišer |
Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial)
VarDial
WS
Preslav Nakov |
Marcos Zampieri |
Nikola Ljubešić |
Jörg Tiedemann |
Shevin Malmasi |
Ahmed Ali |
Findings of the VarDial Evaluation Campaign 2017
VarDial
WS
Marcos Zampieri |
Shervin Malmasi |
Nikola Ljubešić |
Preslav Nakov |
Ahmed Ali |
Jörg Tiedemann |
Yves Scherrer |
Noëmi Aepli |
Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages
BSNLP
WS
Tanja Samardžić |
Mirjana Starović |
Željko Agić |
Nikola Ljubešić |
Adapting a State-of-the-Art Tagger for South Slavic Languages to Non-Standard Text
BSNLP
WS
Nikola Ljubešić |
Tomaž Erjavec |
Darja Fišer |
Language-independent Gender Prediction on Twitter
NLP+CSS
WS
Nikola Ljubešić |
Darja Fišer |
Tomaž Erjavec |
Legal Framework, Dataset and Annotation Schema for Socially Unacceptable Online Discourse Practices in Slovene
ALW
WS
Darja Fišer |
Tomaž Erjavec |
Nikola Ljubešić |
TweetGeo - A Tool for Collecting, Processing and Analysing Geo-encoded Linguistic Data
COLING
Nikola Ljubešić |
Tanja Samardžić |
Curdin Derungs |
Corpus vs. Lexicon Supervision in Morphosyntactic Tagging: the Case of Slovene
LREC
Nikola Ljubešić |
Tomaž Erjavec |
Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor’s Love Affair
LREC
Nikola Ljubešić |
Miquel Esplà-Gomis |
Antonio Toral |
Sergio Ortiz Rojas |
Filip Klubička |
Croatian Error-Annotated Corpus of Non-Professional Written Language
LREC
Vanja Štefanec |
Nikola Ljubešić |
Jelena Kuvač Kraljević |
Corpus-Based Diacritic Restoration for South Slavic Languages
LREC
Nikola Ljubešić |
Tomaž Erjavec |
Darja Fišer |
New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian
LREC
Nikola Ljubešić |
Filip Klubička |
Željko Agić |
Ivo-Pavao Jazbec |
A Global Analysis of Emoji Usage
WAC
WS
Nikola Ljubešić |
Darja Fišer |
Dealing with Data Sparseness in SMT with Factured Models and Morphological Expansion: a Case Study on Croatian
EAMT
WS
Victor M. Sánchez-Cartagena |
Nikola Ljubešić |
Filip Klubička |
Collaborative Development of a Rule-Based Machine Translator between Croatian and Serbian
EAMT
WS
Filip Klubička |
Gema Ramírez-Sánchez |
Nikola Ljubešić |
Private or Corporate? Predicting User Types on Twitter
WNUT
WS
Nikola Ljubešić |
Darja Fišer |
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial3)
VarDial
WS
Preslav Nakov |
Marcos Zampieri |
Liling Tan |
Nikola Ljubešić |
Jörg Tiedemann |
Shervin Malmasi |
Discriminating between Similar Languages and Arabic Dialect Identification: A Report on the Third DSL Shared Task
VarDial
WS
Shervin Malmasi |
Marcos Zampieri |
Nikola Ljubešić |
Preslav Nakov |
Ahmed Ali |
Jörg Tiedemann |
Enlarging Scarce In-domain English-Croatian Corpus for SMT of MOOCs Using Serbian
VarDial
WS
Maja Popović |
Kostadin Cholakov |
Valia Kordoni |
Nikola Ljubešić |
Predicting the Level of Text Standardness in User-generated Content
RANLP
Nikola Ljubešić |
Darja Fišer |
Tomaž Erjavec |
Jaka Čibej |
Dafne Marko |
Senja Pollak |
Iza Škrjanec |
Predicting Inflectional Paradigms and Lemmata of Unknown Words for Semi-automatic Expansion of Morphological Lexicons
RANLP
Nikola Ljubešić |
Miquel Esplà-Gomis |
Filip Klubička |
Nives Mikelić Preradović |
Abu-MaTran at WMT 2015 Translation Task: Morphological Segmentation and Web Crawling
WMT
WS
Raphael Rubino |
Tommi Pirinen |
Miquel Esplà-Gomis |
Nikola Ljubešić |
Sergio Ortiz-Rojas |
Vassilis Papavassiliou |
Prokopis Prokopidis |
Antonio Toral |
Abu-MaTran: Automatic building of Machine Translation
EAMT
WS
Antonio Toral |
Tommi A. Pirinen |
Andy Way |
Gema Ramírez-Sánchez |
Sergio Ortiz Rojas |
Raphael Rubino |
Miquel Esplà |
Mikel L. Forcada |
Vassilis Papavassiliou |
Prokopis Prokopidis |
Nikola Ljubešić |
Universal Dependencies for Croatian (that work for Serbian, too)
BSNLP
WS
Željko Agić |
Nikola Ljubešić |
Regional Linguistic Data Initiative (ReLDI)
BSNLP
WS
Tanja Samardžić |
Nikola Ljubešić |
Maja Miličević |
Proceedings of the Joint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects
LT4VAR
WS
Preslav Nakov |
Marcos Zampieri |
Petya Osenova |
Liling Tan |
Cristina Vertan |
Nikola Ljubešić |
Jörg Tiedemann |
Overview of the DSL Shared Task 2015
LT4VAR
WS
Marcos Zampieri |
Liling Tan |
Nikola Ljubešić |
Jörg Tiedemann |
Preslav Nakov |
Abu-MaTran: Automatic building of Machine Translation
EAMT
Antonio Toral |
Tommi A Pirinen |
Andy Way |
Gema Ramírez-Sánchez |
Sergio Ortiz Rojas |
Raphael Rubino |
Miquel Esplà |
Mikel Forcada |
Vassilis Papavassiliou |
Prokopis Prokopidis |
Nikola Ljubešić |
Comparing two acquisition systems for automatically building an English—Croatian parallel corpus from multilingual websites
LREC
Miquel Esplà-Gomis |
Filip Klubička |
Nikola Ljubešić |
Sergio Ortiz-Rojas |
Vassilis Papavassiliou |
Prokopis Prokopidis |
The SETimes.HR Linguistically Annotated Corpus of Croatian
LREC
Željko Agić |
Nikola Ljubešić |
Quality Estimation for Synthetic Parallel Data Generation
LREC
Raphael Rubino |
Antonio Toral |
Nikola Ljubešić |
Gema Ramírez-Sánchez |
TweetCaT: a tool for building Twitter corpora of smaller languages
LREC
Nikola Ljubešić |
Darja Fišer |
Tomaž Erjavec |
caWaC – A web corpus of Catalan and its application to language modeling and machine translation
LREC
Nikola Ljubešić |
Antonio Toral |
{bs,hr,sr}WaC - Web Corpora of Bosnian, Croatian and Serbian
WAC
WS
Nikola Ljubešić |
Filip Klubička |
Exploring cross-language statistical machine translation for closely related South Slavic languages
LT4VAR
WS
Maja Popović |
Nikola Ljubešić |
Proceedings of the First Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects
VarDial
WS
Marcos Zampieri |
Liling Tan |
Nikola Ljubešić |
Jörg Tiedemann |
A Report on the DSL Shared Task 2014
VarDial
WS
Marcos Zampieri |
Liling Tan |
Nikola Ljubešić |
Jörg Tiedemann |
Lemmatization and Morphosyntactic Tagging of Croatian and Serbian
WS
Željko Agić |
Nikola Ljubešić |
Danijela Merkler |
Identifying false friends between closely related languages
WS
Nikola Ljubešić |
Darja Fišer |
Cross-lingual WSD for Translation Extraction from Comparable Corpora
BUCC
WS
Marianna Apidianaki |
Nikola Ljubešić |
Darja Fišer |
Efficient Discrimination Between Closely Related Languages
COLING
Jörg Tiedemann |
Nikola Ljubešić |
Addressing polysemy in bilingual lexicon extraction from comparable corpora
LREC
Darja Fišer |
Nikola Ljubešić |
Ozren Kubelka |
Bilingual lexicon extraction from comparable corpora for closely related languages
RANLP
Darja Fišer |
Nikola Ljubešić |
Building and Using Comparable Corpora for Domain-Specific Bilingual Lexicon Extraction
WS
Darja Fišer |
Nikola Ljubešić |
Špela Vintar |
Senja Pollak |
Building a Gold Standard for Event Detection in Croatian
LREC
Nikola Ljubešić |
Tomislava Lauc |
Damir Boras |
Towards Sentiment Analysis of Financial Texts in Croatian
LREC
Željko Agić |
Nikola Ljubešić |
Marko Tadić |
Generating a Morphological Lexicon of Organization Entity Names
LREC
Nikola Ljubešić |
Tomislava Lauc |
Damir Boras |
Linguistic
Task
Approach
Language
Dataset Type
.