NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Tommi Jauhiainen
Number of Papers:- 25
Number of Citations:- 24
First ACL Paper:- 2001
Latest ACL Paper:- 2024
Venues:-
NAACL
EACL
SIGUL
VarDial
NoDaLiDa
LT4VAR
WAC
WS
ParlaCLARIN
LREC
COLING
Co-Authors:-
Andrei Butnaru
Bharathi Raja Chakravarthi
Christoph Purschke
Chu Ren Huang
Dirk Hovy
Similar Authors:-
Julio Gonzales
Synny Diwakar
Jochen Schopp
Mariana Kaiseler
Eddi Gbery
2024
2022
2021
2020
2019
2018
2017
2016
2015
2001
Proceedings of the Eleventh Workshop on NLP for Similar Languages, Varieties, and Dialects (VarDial 2024)
VarDial
WS
Yves Scherrer |
Tommi Jauhiainen |
Nikola Ljubešić |
Marcos Zampieri |
Preslav Nakov |
Jörg Tiedemann |
Investigating Multilinguality in the Plenary Sessions of the Parliament of Finland with Automatic Language Identification
ParlaCLARIN
WS
Tommi Jauhiainen |
Jussi Piitulainen |
Erik Axelson |
Ute Dieckmann |
Mietta Lennes |
Jyrki Niemi |
Jack Rueter |
Krister Lindén |
Improving Language Coverage on HeLI-OTS
SIGUL
WS
Tommi Jauhiainen |
Krister Lindén |
HeLI-OTS, Off-the-shelf Language Identifier for Text
LREC
Tommi Jauhiainen |
Heidi Jauhiainen |
Krister Lindén |
Italian Language and Dialect Identification and Regional French Variety Detection using Adaptive Naive Bayes
COLING
VarDial
WS
Tommi Jauhiainen |
Heidi Jauhiainen |
Krister Lindén |
Proceedings of the Ninth Workshop on NLP for Similar Languages, Varieties and Dialects
COLING
VarDial
WS
Yves Scherrer |
Tommi Jauhiainen |
Nikola Ljubešić |
Preslav Nakov |
Jörg Tiedemann |
Marcos Zampieri |
Findings of the VarDial Evaluation Campaign 2021
EACL
VarDial
Bharathi Raja Chakravarthi |
Gaman Mihaela |
Radu Tudor Ionescu |
Heidi Jauhiainen |
Tommi Jauhiainen |
Krister Lindén |
Nikola Ljubešić |
Niko Partanen |
Ruba Priyadharshini |
Christoph Purschke |
Eswari Rajagopal |
Yves Scherrer |
Marcos Zampieri |
Naive Bayes-based Experiments in Romanian Dialect Identification
EACL
VarDial
Tommi Jauhiainen |
Heidi Jauhiainen |
Krister Lindén |
Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects
EACL
VarDial
Marcos Zampieri |
Preslav Nakov |
Nikola Ljubešić |
Jörg Tiedemann |
Yves Scherrer |
Tommi Jauhiainen |
Comparing Approaches to Dravidian Language Identification
EACL
VarDial
Tommi Jauhiainen |
Tharindu Ranasinghe |
Marcos Zampieri |
Experiments in Language Variety Geolocation and Dialect Identification
COLING
VarDial
Tommi Jauhiainen |
Heidi Jauhiainen |
Krister Lindén |
A Report on the VarDial Evaluation Campaign 2020
COLING
VarDial
Mihaela Gaman |
Dirk Hovy |
Radu Tudor Ionescu |
Heidi Jauhiainen |
Tommi Jauhiainen |
Krister Lindén |
Nikola Ljubešić |
Niko Partanen |
Christoph Purschke |
Yves Scherrer |
Marcos Zampieri |
Uralic Language Identification (ULI) 2020 shared task dataset and the Wanca 2017 corpora
COLING
VarDial
Tommi Jauhiainen |
Heidi Jauhiainen |
Niko Partanen |
Krister Lindén |
Building Web Corpora for Minority Languages
LREC
WAC
WS
Heidi Jauhiainen |
Tommi Jauhiainen |
Krister Lindén |
A Report on the Third VarDial Evaluation Campaign
NAACL
WS
Marcos Zampieri |
Shervin Malmasi |
Yves Scherrer |
Tanja Samardžić |
Francis Tyers |
Miikka Silfverberg |
Natalia Klyueva |
Tung-Le Pan |
Chu-Ren Huang |
Radu Tudor Ionescu |
Andrei M. Butnaru |
Tommi Jauhiainen |
Language and Dialect Identification of Cuneiform Texts
NAACL
WS
Tommi Jauhiainen |
Heidi Jauhiainen |
Tero Alstola |
Krister Lindén |
Discriminating between Mandarin Chinese and Swiss-German varieties using adaptive language models
NAACL
WS
Tommi Jauhiainen |
Krister Lindén |
Heidi Jauhiainen |
Iterative Language Model Adaptation for Indo-Aryan Language Identification
COLING
VarDial
WS
Tommi Jauhiainen |
Heidi Jauhiainen |
Krister Lindén |
HeLI-based Experiments in Discriminating Between Dutch and Flemish Subtitles
COLING
VarDial
WS
Tommi Jauhiainen |
Heidi Jauhiainen |
Krister Lindén |
HeLI-based Experiments in Swiss German Dialect Identification
COLING
VarDial
WS
Tommi Jauhiainen |
Heidi Jauhiainen |
Krister Lindén |
Evaluation of language identification methods using 285 languages
NoDaLiDa
WS
Tommi Jauhiainen |
Krister Lindén |
Heidi Jauhiainen |
Evaluating HeLI with Non-Linear Mappings
VarDial
WS
Tommi Jauhiainen |
Krister Lindén |
Heidi Jauhiainen |
HeLI, a Word-Based Backoff Method for Language Identification
VarDial
WS
Tommi Jauhiainen |
Krister Lindén |
Heidi Jauhiainen |
Discriminating Similar Languages with Token-Based Backoff
LT4VAR
WS
Tommi Jauhiainen |
Heidi Jauhiainen |
Krister Lindén |
Using existing written language analyzers in understanding natural spoken Finnish
NoDaLiDa
WS
Tommi Jauhiainen |
Linguistic
Task
Approach
Language
Dataset Type
.