NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning
Yifan Peng
|
Krishna C Puvvada
|
Zhehuai Chen
|
Piotr Zelasko
|
He Huang
|
Kunal Dhawan
|
Ke Hu
|
Shinji Watanabe
|
Jagadeesh Balam
|
Boris Ginsburg
|
Paper Details:
Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue:
NAACL |
Citations
URL
No Citations Yet
https://github.com/pyf98/NeMo_VoiceTextBlender
https://www.santaclaracounty.org/
https://www.ama.org/
https://www.nih.gov/
https://huggingface.co/google/gemma-2-27b-it
https://catalog.ngc.nvidia.com/orgs/
https://huggingface.co/datasets/Magpie-Align/
https://huggingface.co/nvidia/canary-1b
https://catalog.ngc.nvidia.com/orgs/
https://github.com/EleutherAI/
https://crfm
https://www.imda.gov.sg/how-we-can-help/
https://www.mllp.upv.es/git-pub/
https://huggingface.co/datasets/
https://commonvoice.mozilla.org/en/datasets
Field Of Study