NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages
Artur Kiulian
|
Anton Polishko
|
Mykola Khandoga
|
Yevhen Kostiuk
|
Guillermo Gabrielli
|
Łukasz Gagała
|
Fadi Zaraket
|
Qusai Abu Obaida
|
Hrishikesh Garud
|
Wendy Wing Yee Mak
|
Dmytro Chaplynskyi
|
Selma Amor
|
Grigol Peradze
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria (online)
Venue:
UNLP |
WS |
Citations
URL
No Citations Yet
https://huggingface.co/collections/PolyAgent/
https://dohadictionary.org/
https://github.com/PolyAgent/
https://www.wikipedia.org
https://huggingface.co/EvanD/
https://spacy.io/models/uk#uk_core_news_lg
https://www.fda.gov/food/
https://github.com/PolyAgent/
https://huggingface.co/megantosh/
https://huggingface.co/ychenNLP/
https://huggingface.co/CAMeL-Lab/
Field Of Study