Edge TTS Demo
SesameAILabs/csm: A Conversational Speech Generation Model
A Conversational Speech Generation Model. Contribute to SesameAILabs/csm development by creating an account on GitHub.
Plachtaa/VALL-E-X: An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/ - Plachtaa/VALL-E-X
rsxdalv/tts-generation-webui: TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS) - rsxdalv/tts-generation-webui
SWivid/F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" - SWivid/F5-TTS
huggingface/parler-tts · GitHub
Inference and training library for high-quality TTS models. - huggingface/parler-tts
coqui-ai/TTS · GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production - coqui-ai/TTS
fishaudio/fish-speech: Brand new TTS solution
Brand new TTS solution. Contribute to fishaudio/fish-speech development by creating an account on GitHub.
mkiol/dsnote: Speech Note Linux app
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation. - mkiol/dsnote
ihuguet/picotts · GitHub
Pico TTS: text to speech voice sinthesizer from SVox, included in Android AOSP - ihuguet/picotts
davidacm/NVDA-IBMTTS-Driver · GitHub
This project is aimed at developing and maintaining the NVDA IBMTTS driver. IBMTTS is a synthesizer similar to Eloquence. Please send your ideas and contributions here! - davidacm/NVDA-IBMTTS-Driver
mush42/sonata-nvda · GitHub
This add-on implements a speech synthesizer driver for NVDA using neural TTS models. It supports Piper - mush42/sonata-nvda
muflone/gespeaker · GitHub
A text to speech GTK+ front-end for eSpeak and mbrola to play a text in many languages with settings for voice, pitch, volume and speed - muflone/gespeaker
LokerL/tts-vue · GitHub
🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。 - LokerL/tts-vue
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time - GitHub - CorentinJ/Real-Time-Voice-Cloning: Clone a voice in 5 seconds to generate arbitrary speech in real-time