NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Tomaz Erjavec
Number of Papers:- 50
Number of Citations:- 74
First ACL Paper:- 1990
Latest ACL Paper:- 2024
Venues:-
LaTeCH
EMNLP
BSNLP
NLP+CSS
COLING
RANLP
ALW
MTSummit
WS
EAMT
ParlaCLARIN
LAW
ACL
LREC
Co-Authors:-
Adam Kilgarriff
Alejandro Bia
Ales Tavcar
Andraz Repar
Andrej Pancur
Similar Authors:-
Synny Diwakar
Jochen Schopp
Mariana Kaiseler
Eddi Gbery
Elia Yuste
2024
2022
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2008
2006
2004
2003
2002
2001
2000
1999
1998
1990
Multilingual Power and Ideology identification in the Parliament: a reference dataset and simple baselines
ParlaCLARIN
WS
Çağrı Çöltekin |
Matyáš Kopp |
Meden Katja |
Vaidas Morkevicius |
Nikola Ljubešić |
Tomaž Erjavec |
ParlaMint II: The Show Must Go On
LREC
ParlaCLARIN
Maciej Ogrodniczuk |
Petya Osenova |
Tomaž Erjavec |
Darja Fišer |
Nikola Ljubešić |
Çağrı Çöltekin |
Matyáš Kopp |
Meden Katja |
Dealing with Abbreviations in the Slovenian Biographical Lexicon
EMNLP
Angel Daza |
Antske Fokkens |
Tomaž Erjavec |
The siParl corpus of Slovene parliamentary proceedings
LREC
ParlaCLARIN
WS
Andrej Pancur |
Tomaž Erjavec |
Gigafida 2.0: The Reference Corpus of Written Standard Slovene
LREC
Simon Krek |
Špela Arhar Holdt |
Tomaž Erjavec |
Jaka Čibej |
Andraz Repar |
Polona Gantar |
Nikola Ljubešić |
Iztok Kosem |
Kaja Dobrovoljc |
Proceedings of the 7th Workshop on Balto-Slavic Natural Language Processing
ACL
WS
Tomaž Erjavec |
Michał Marcińczuk |
Preslav Nakov |
Jakub Piskorski |
Lidia Pivovarova |
Jan Šnajder |
Josef Steinberger |
Roman Yangarber |
Improving UD processing via satellite resources for morphology
WS
Kaja Dobrovoljc |
Tomaž Erjavec |
Nikola Ljubešić |
CLARIN’s Key Resource Families
LREC
Darja Fišer |
Jakob Lenardič |
Tomaž Erjavec |
Datasets of Slovene and Croatian Moderated News Comments
EMNLP
WS
Nikola Ljubešić |
Tomaž Erjavec |
Darja Fišer |
Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing
BSNLP
WS
Tomaž Erjavec |
Jakub Piskorski |
Lidia Pivovarova |
Jan Šnajder |
Josef Steinberger |
Roman Yangarber |
The Universal Dependencies Treebank for Slovenian
BSNLP
WS
Kaja Dobrovoljc |
Tomaž Erjavec |
Simon Krek |
Adapting a State-of-the-Art Tagger for South Slavic Languages to Non-Standard Text
BSNLP
WS
Nikola Ljubešić |
Tomaž Erjavec |
Darja Fišer |
Language-independent Gender Prediction on Twitter
NLP+CSS
WS
Nikola Ljubešić |
Darja Fišer |
Tomaž Erjavec |
Legal Framework, Dataset and Annotation Schema for Socially Unacceptable Online Discourse Practices in Slovene
ALW
WS
Darja Fišer |
Tomaž Erjavec |
Nikola Ljubešić |
Corpus vs. Lexicon Supervision in Morphosyntactic Tagging: the Case of Slovene
LREC
Nikola Ljubešić |
Tomaž Erjavec |
Corpus-Based Diacritic Restoration for South Slavic Languages
LREC
Nikola Ljubešić |
Tomaž Erjavec |
Darja Fišer |
Predicting the Level of Text Standardness in User-generated Content
RANLP
Nikola Ljubešić |
Darja Fišer |
Tomaž Erjavec |
Jaka Čibej |
Dafne Marko |
Senja Pollak |
Iza Škrjanec |
sloWCrowd: A crowdsourcing tool for lexicographic tasks
LREC
Darja Fišer |
Aleš Tavčar |
Tomaž Erjavec |
TweetCaT: a tool for building Twitter corpora of smaller languages
LREC
Nikola Ljubešić |
Darja Fišer |
Tomaž Erjavec |
Modernizing historical Slovene words with character-based SMT
WS
Yves Scherrer |
Tomaž Erjavec |
The goo300k corpus of historical Slovene
LREC
Tomaž Erjavec |
Lexicon Construction and Corpus Annotation of Historical Language with the CoBaLT Editor
LaTeCH
WS
Tom Kenter |
Tomaž Erjavec |
Maja Žorga Dulmin |
Darja Fišer |
OWL/DL formalization of the MULTEXT-East morphosyntactic specifications
LAW
WS
Christian Chiarcos |
Tomaž Erjavec |
Automatic linguistic annotation of historical language: ToTrTaLe and XIX century Slovene
LaTeCH
WS
Tomaž Erjavec |
MULTEXT-East Version 4: Multilingual Morphosyntactic Specifications, Lexicons and Corpora
LREC
Tomaž Erjavec |
The JOS Linguistically Tagged Corpus of Slovene
LREC
Tomaž Erjavec |
Darja Fišer |
Simon Krek |
Nina Ledinek |
Experimental Deployment of a Grid Virtual Organization for Human Language Technologies
LREC
Jan Jona Javoršek |
Tomaž Erjavec |
The JOS Morphosyntactically Tagged Corpus of Slovene
LREC
Tomaž Erjavec |
Simon Krek |
Designing and Evaluating a Russian Tagset
LREC
Serge Sharoff |
Mikhail Kopotev |
Tomaž Erjavec |
Anna Feldman |
Dagmar Divjak |
Towards a Slovene Dependency Treebank
LREC
Sašo Džeroski |
Tomaž Erjavec |
Nina Ledinek |
Petr Pajas |
Zdenek Žabokrtsky |
Andreja Žele |
The English-Slovene ACQUIS corpus
LREC
Tomaž Erjavec |
Building Slovene WordNet
LREC
Tomaž Erjavec |
Darja Fišer |
The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages
LREC
Ralf Steinberger |
Bruno Pouliquen |
Anna Widiger |
Camelia Ignat |
Tomaž Erjavec |
Dan Tufiş |
Dániel Varga |
Making an XML-based Japanese-Slovene Learners’ Dictionary
LREC
Tomaž Erjavec |
Kristina Hmeljak Sangawa |
Irena Srdanović |
Anton ml. Vahčič |
MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora
LREC
Tomaž Erjavec |
Migrating Language Resources from SGML to XML: The Text Encoding Initiative Recommendations
LREC
Syd Bauman |
Alejandro Bia |
Lou Burnard |
Tomaž Erjavec |
Christine Ruotolo |
Susan Schreibman |
Towards an International Standard on Feature Structure Representation
LREC
Kiyong Lee |
Lou Burnard |
Laurent Romary |
Eric de la Clergerie |
Thierry Declerck |
Syd Bauman |
Harry Bunt |
Lionel Clément |
Tomaž Erjavec |
Azim Roussanaly |
Claude Roux |
Encoding Biomedical Resources in TEI: The Case of the GENIA Corpus
WS
Tomaz Erjavec |
Jin-Dong Kim |
Tomoko Ohta |
Yuka Tateisi |
Jun’ichi Tsujii |
Stretching TEI: Converting the Genia Corpus
WS
Tomaz Erjavec |
Jin-Dong Kim |
Tomoko Ohta |
Yuka Tateisi |
Jun-ichi Tsujii |
The MULTEXT-East Morphosyntactic Specification for Slavic Languages
WS
Tomaž Erjavec |
Cvetana Krstev |
Vladimír Petkevič |
Kiril Simov |
Marko Tadić |
Duško Vitas |
Sense Discrimination with Parallel Corpora
WS
Nancy Ide |
Tomaz Erjavec |
Dan Tufis |
The TELRI tool catalogue: structure and prospects
WS
Tomaž Erjavec |
Tamás Váradi |
Morphosyntactic Tagging of Slovene: Evaluating Taggers and Tagsets
LREC
Sašo Džeroski |
Tomaž Erjavec |
Jakub Zavrel |
Corpora of Slovene Spoken Language for Multi-lingual Applications
LREC
Jerneja Gros |
France Mihelič |
Simon Dobrišek |
Tomaž Erjavec |
Mario Žganec |
The Concede Model for Lexical Databases
LREC
Tomaž Erjavec |
Roger Evans |
Nancy Ide |
Adam Kilgarriff |
Slovene–English Datasets for MT
EAMT
Tomaž Erjavec |
The ELAN Slovene-English aligned corpus
MTSummit
Tomaz Erjavec |
Multext-East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages
COLING
Ludmila DIMITROVA |
Tomaz ERJAVEC |
Nancy IDE |
Heiki Jaan KAALEP |
Vladimir PETKEVIC |
Dan TUFIS |
Multext-East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages
ACL
COLING
Ludmila Dimitrova |
Tomaz Erjavec |
Nancy Ide |
Heiki Jaan Kaalep |
Vladimir Petkevic |
Dan Tufis |
AN INTEGRATED SYSTEM FOR MORPHOLOGICAL ANALYSIS OF THE SLOVENE LANGUAGE
COLING
Tomaz Erjavec |
Peter Tancig |
Linguistic
Task
Approach
Language
Dataset Type
.