Audio computing

146 bookmarks
Newest
SpeechNinja - Type to Speak
SpeechNinja - Type to Speak
SpeechNinja says what you type in real time. It enables people with speech difficulties to speak out loud using synthesized voice (AAC) and more.
·speechninja.co·
SpeechNinja - Type to Speak
Text to Speech - TTS Online Converter Tools
Text to Speech - TTS Online Converter Tools
We developed an online text-to-speech synthesis tool, which converts text into natural and smooth human voice, provides 100+ speakers for you to choose, supports multi-language, multi-dialect and Chinese-English mixing, and can configure audio flexibly parameter. It is widely used in news reading, travel navigation, intelligent hardware and notification broadcasting. And can convert the text content into MP3 files to download and save.
·text-to-speech.online·
Text to Speech - TTS Online Converter Tools
Plachtaa/VALL-E-X: An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Plachtaa/VALL-E-X: An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/ - Plachtaa/VALL-E-X
·github.com·
Plachtaa/VALL-E-X: An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
rsxdalv/tts-generation-webui: TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
rsxdalv/tts-generation-webui: TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS) - rsxdalv/tts-generation-webui
·github.com·
rsxdalv/tts-generation-webui: TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
OpenAI.fm
OpenAI.fm
An interactive demo for developers to try the new text-to-speech model in the OpenAI API
·openai.fm·
OpenAI.fm
Bland AI | Automate Phone Calls with Conversational AI for Enterprises
Bland AI | Automate Phone Calls with Conversational AI for Enterprises
Transform your enterprise communication with Bland AI. Automate inbound and outbound phone calls using AI that sounds human. Perfect for sales, customer support, and operations with customizable voices and seamless integrations.
·bland.ai·
Bland AI | Automate Phone Calls with Conversational AI for Enterprises
‎Transcriptor
‎Transcriptor
‎Convert voice to text in real time! The UI couldn't be simpler! You can edit, search and share all your transcriptions. Your transcriptions are automatically saved to iCloud. Supported languages: English, Arabic, Chinese, Dutch, French, German, Hindi, Indonesian, Italian, Japanese, Korean, Polish…
·apps.apple.com·
‎Transcriptor
TTSMaker - Free Text to Speech Online
TTSMaker - Free Text to Speech Online
TTSMaker is a free text-to-speech tool and an online text reader that can convert text to speech, as an AI voice generator, it supports 100+ languages and 300+ voice styles, powerful neural network makes speech sound more natural, you can listen online, or download audio files in mp3, wav format.
·ttsmaker.com·
TTSMaker - Free Text to Speech Online
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi...
·github.com·
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Parrot AI - Celebrity Voice Generator
Parrot AI - Celebrity Voice Generator
Parrot AI is the top celebrity voice generator. Create fun audio clips to roast your friends, send birthday messages, and light up your group chat!
·tryparrotai.com·
Parrot AI - Celebrity Voice Generator