Audio

Audio

562 bookmarks
Newest
Mureka
Mureka
Capture and amplify the spark of inspiration in your heart on Mureka. Input any musical inspiration, including audio, to create a complete song. Publish the songs you create on Mureka to the store for sale and earn a stable copyright income. Your songs will have the opportunity to be sold to music lovers around the world.
·mureka.ai·
Mureka
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi...
·github.com·
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
VALL-E
VALL-E
VALL-E is a neural codec language model using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather. VALL-E emerges in-context learning capabilities and can be used to synthesize high-quality personalized speech with only a 3-second enrolled recording of an unseen speaker as a prompt. We also extend VALL-E and train a multi-lingual conditional codec language model. VALL-E X can generate high-quality speech in the target language via just one speech utterance in the source language as a prompt while preserving the unseen speaker’s voice, emotion, and acoustic environment.
·microsoft.com·
VALL-E
Voicery Text-to-Speech
Voicery Text-to-Speech
Voicery creates natural-sounding Text-to-Speech (TTS) engines and custom brand voices for enterprise. Our solutions leverage cutting-edge deep-learning research optimized for your business use-case and technical infrastructure.
·voicery.com·
Voicery Text-to-Speech