NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey
Tianxin Xie
|
Yan Rong
|
Pengfei Zhang
|
Wenwu Wang
|
Li Liu
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/imxtx/
https://huggingface.co/datasets/
https://en
https://github.com/facebookresearch/fairseq/tree/main/examples/wav2vec#vq-wav2vec
https://github.com/facebookresearch/fairseq/tree/main/examples/wav2vec
https://github.com/facebookresearch/fairseq/tree/main/examples/hubert
https://github.com/openai/whisper
https://github.com/facebookresearch/fairseq/tree/main/examples/data2vec
https://huggingface.co/facebook/w2v-bert-2.0
https://github.com/wesbz/SoundStream
https://github.com/facebookresearch/encodec
https://github.com/yangdongchao/AcademiCodec
https://github.com/ZhangXInFD/SpeechTokenizer
https://github.com/descriptinc/descript-audio-codec
https://github.com/kyutai-labs/moshi
https://github.com/jishengpeng/WavTokenizer
Field Of Study