Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers [Paper] Chengyi Wang*, Â Sanyuan Chen*, Â Yu Wu*, Â Ziqiang Zhang, Â Long Zhou, Â Shujie Liu, Zhuo Chen, Â Yanqing Liu, Â Huaming Wang, Â Jinyu Li, Â Lei He, Â Sheng Zhao, Â Furu Wei Microsoft Abstract. We introduce a language modeling approach for text to speech synthesis (TTS). Specifically, we train a neural codec language m
{{#tags}}- {{label}}
{{/tags}}