NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers#Text-to-Speech#Microsoft#Generative Models#Audio#Paper#PDF·arxiv.org·Apr 30, 2023NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers