Audio & Music

Audio & Music

615 bookmarks
Newest
VALL-E
VALL-E
VALL-E is a neural codec language model using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather. VALL-E emerges in-context learning capabilities and can be used to synthesize high-quality personalized speech with only a 3-second enrolled recording of an unseen speaker as a prompt. We also extend VALL-E and train a multi-lingual conditional codec language model. VALL-E X can generate high-quality speech in the target language via just one speech utterance in the source language as a prompt while preserving the unseen speaker’s voice, emotion, and acoustic environment.
·microsoft.com·
VALL-E
Voicery Text-to-Speech
Voicery Text-to-Speech
Voicery creates natural-sounding Text-to-Speech (TTS) engines and custom brand voices for enterprise. Our solutions leverage cutting-edge deep-learning research optimized for your business use-case and technical infrastructure.
·voicery.com·
Voicery Text-to-Speech
ChordU - chords for any song
ChordU - chords for any song
Get piano, ukulele & guitar chords with variations for any song you love, play along with chords, change transpose and many more.
·chordu.com·
ChordU - chords for any song