Voice computing

142 bookmarks
Newest
Nuance - Dragon Speech Recognition
Nuance - Dragon Speech Recognition
Work faster and smarter and speed document creation and automate workflows with the world's best-selling speech recognition solution.
·nuance.com·
Nuance - Dragon Speech Recognition
‎Transcriptor
‎Transcriptor
‎Convert voice to text in real time! The UI couldn't be simpler! You can edit, search and share all your transcriptions. Your transcriptions are automatically saved to iCloud. Supported languages: English, Arabic, Chinese, Dutch, French, German, Hindi, Indonesian, Italian, Japanese, Korean, Polish…
·apps.apple.com·
‎Transcriptor
TTSMaker - Free Text to Speech Online
TTSMaker - Free Text to Speech Online
TTSMaker is a free text-to-speech tool and an online text reader that can convert text to speech, as an AI voice generator, it supports 100+ languages and 300+ voice styles, powerful neural network makes speech sound more natural, you can listen online, or download audio files in mp3, wav format.
·ttsmaker.com·
TTSMaker - Free Text to Speech Online
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi...
·github.com·
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
w-okada/voice-changer · GitHub
w-okada/voice-changer · GitHub
リアルタイムボイスチェンジャー Realtime Voice Changer. Contribute to w-okada/voice-changer development by creating an account on GitHub.
·github.com·
w-okada/voice-changer · GitHub
VALL-E
VALL-E
VALL-E is a neural codec language model using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather. VALL-E emerges in-context learning capabilities and can be used to synthesize high-quality personalized speech with only a 3-second enrolled recording of an unseen speaker as a prompt. We also extend VALL-E and train a multi-lingual conditional codec language model. VALL-E X can generate high-quality speech in the target language via just one speech utterance in the source language as a prompt while preserving the unseen speaker’s voice, emotion, and acoustic environment.
·microsoft.com·
VALL-E
Voicery Text-to-Speech
Voicery Text-to-Speech
Voicery creates natural-sounding Text-to-Speech (TTS) engines and custom brand voices for enterprise. Our solutions leverage cutting-edge deep-learning research optimized for your business use-case and technical infrastructure.
·voicery.com·
Voicery Text-to-Speech