Speechelo - The Best Text To Speech Softare
Audio computing
Camb.ai: AI Voice Translation & Dubbing for Videos
Camb.ai is AI-driven video content localization platform built for content creators and media producers. Join 100s of video first companies who use Camb.ai to d
Plachtaa/VALL-E-X: An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/ - Plachtaa/VALL-E-X
rsxdalv/tts-generation-webui: TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS) - rsxdalv/tts-generation-webui
OpenAI.fm
An interactive demo for developers to try the new text-to-speech model in the OpenAI API
Bland AI | Automate Phone Calls with Conversational AI for Enterprises
Transform your enterprise communication with Bland AI. Automate inbound and outbound phone calls using AI that sounds human. Perfect for sales, customer support, and operations with customizable voices and seamless integrations.
AssemblyAI | AI models to transcribe and understand speech
With AssemblyAI's industry-leading Speech AI models, transcribe speech to text and extract insights from your voice data.
Nuance - Dragon Speech Recognition
Work faster and smarter and speed document creation and automate workflows with the world's best-selling speech recognition solution.
Text to Speech: Generate natural sounding voices and voice overs
Download voices as MP3. Create phone announcements, YouTube, Explainer, E-learning Videos and more.
Voicy Speech to Text
Speech to Text Chrome Extension
Write with your voice on every website. AI-powered dictation tool.
DuRT - Speech Recognition
Description will go into a meta tag in head /
Transcriptor
Convert voice to text in real time! The UI couldn't be simpler!
You can edit, search and share all your transcriptions.
Your transcriptions are automatically saved to iCloud.
Supported languages: English, Arabic, Chinese, Dutch, French, German, Hindi, Indonesian, Italian, Japanese, Korean, Polish…
ken107/piper-browser-extension · GitHub
Provides Piper neural text-to-speech voices as a browser extension - ken107/piper-browser-extension
TTSMaker - Free Text to Speech Online
TTSMaker is a free text-to-speech tool and an online text reader that can convert text to speech, as an AI voice generator, it supports 100+ languages and 300+ voice styles, powerful neural network makes speech sound more natural, you can listen online, or download audio files in mp3, wav format.
abus-aikorea/voice-pro · GitHub
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube dow...
X to Voice | ElevenLabs
Analyze your X profile to generate a unique voice using ElevenLabs' new Voice Design feature
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi...
Parrot AI - Celebrity Voice Generator
Parrot AI is the top celebrity voice generator. Create fun audio clips to roast your friends, send birthday messages, and light up your group chat!
w-okada/voice-changer · GitHub
リアルタイムボイスチェンジャー Realtime Voice Changer. Contribute to w-okada/voice-changer development by creating an account on GitHub.
RVC-Project/Retrieval-based-Voice-Conversion-WebUI: Easily train a good VC model with voice data = 10 mins!
Easily train a good VC model with voice data
neonbjb/tortoise-tts · GitHub
A multi-voice TTS system trained with an emphasis on quality - neonbjb/tortoise-tts
SWivid/F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" - SWivid/F5-TTS
Applio
At the forefront of innovation as an open-source ecosystem that hosts cutting-edge AI voice cloning technologies.
HoldSpeak - Type 3x faster with AI powered voice-to-text
HoldSpeak is a AI-powered app that allows you to type 3x faster
voxforge.org - Free Speech... Recognition (Linux, Windows and Mac)
VALL-E
VALL-E is a neural codec language model using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather. VALL-E emerges in-context learning capabilities and can be used to synthesize high-quality personalized speech with only a 3-second enrolled recording of an unseen speaker as a prompt. We also extend VALL-E and train a multi-lingual conditional codec language model. VALL-E X can generate high-quality speech in the target language via just one speech utterance in the source language as a prompt while preserving the unseen speaker’s voice, emotion, and acoustic environment.
WhisperSpeech/WhisperSpeech · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface/parler-tts · GitHub
Inference and training library for high-quality TTS models. - huggingface/parler-tts
DiTTo-TTS
coqui-ai/TTS · GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production - coqui-ai/TTS