Edge TTS Demo
travisvn/openai-edge-tts: Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs
Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs - travisvn/openai-edge-tts
gexgd0419/NaturalVoiceSAPIAdapter: Make Azure natural TTS voices accessible to any SAPI 5-compatible application.
Make Azure natural TTS voices accessible to any SAPI 5-compatible application. - gexgd0419/NaturalVoiceSAPIAdapter
resemble-ai/chatterbox: SoTA open-source TTS
SoTA open-source TTS. Contribute to resemble-ai/chatterbox development by creating an account on GitHub.
yl4579/StyleTTS2: StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models - yl4579/StyleTTS2
ProperCode/Work-by-Speech: Windows app which allows efficient work on a computer by speech alone.
Windows app which allows efficient work on a computer by speech alone. - ProperCode/Work-by-Speech
nari-labs/dia: A TTS model capable of generating ltra-realistic dialogue in one pass.
A TTS model capable of generating ultra-realistic dialogue in one pass. - nari-labs/dia
SesameAILabs/csm: A Conversational Speech Generation Model
A Conversational Speech Generation Model. Contribute to SesameAILabs/csm development by creating an account on GitHub.
Plachtaa/VALL-E-X: An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/ - Plachtaa/VALL-E-X
rsxdalv/tts-generation-webui: TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS) - rsxdalv/tts-generation-webui
ken107/piper-browser-extension · GitHub
Provides Piper neural text-to-speech voices as a browser extension - ken107/piper-browser-extension
abus-aikorea/voice-pro · GitHub
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube dow...
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi...
w-okada/voice-changer · GitHub
リアルタイムボイスチェンジャー Realtime Voice Changer. Contribute to w-okada/voice-changer development by creating an account on GitHub.
RVC-Project/Retrieval-based-Voice-Conversion-WebUI: Easily train a good VC model with voice data = 10 mins!
Easily train a good VC model with voice data
neonbjb/tortoise-tts · GitHub
A multi-voice TTS system trained with an emphasis on quality - neonbjb/tortoise-tts
SWivid/F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" - SWivid/F5-TTS
huggingface/parler-tts · GitHub
Inference and training library for high-quality TTS models. - huggingface/parler-tts
coqui-ai/TTS · GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production - coqui-ai/TTS
fishaudio/fish-speech: Brand new TTS solution
Brand new TTS solution. Contribute to fishaudio/fish-speech development by creating an account on GitHub.
mkiol/dsnote: Speech Note Linux app
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation. - mkiol/dsnote
ihuguet/picotts · GitHub
Pico TTS: text to speech voice sinthesizer from SVox, included in Android AOSP - ihuguet/picotts
thorstenMueller/Thorsten-Voice · GitHub
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling. - thorstenMueller/Thorsten-Voice
nvaccess/nvda: NVDA, the free and open source Screen Reader for Microsoft Windows
NVDA, the free and open source Screen Reader for Microsoft Windows - nvaccess/nvda
evuraan/mintPiper: Make Linux speak what's on the screen: clearly and securely.
Make Linux speak what's on the screen: clearly and securely. - evuraan/mintPiper
davidacm/NVDA-IBMTTS-Driver · GitHub
This project is aimed at developing and maintaining the NVDA IBMTTS driver. IBMTTS is a synthesizer similar to Eloquence. Please send your ideas and contributions here! - davidacm/NVDA-IBMTTS-Driver
mush42/sonata-nvda · GitHub
This add-on implements a speech synthesizer driver for NVDA using neural TTS models. It supports Piper - mush42/sonata-nvda
CrashXBETAX/Text_To_Speech_Live_WinUI3_Public · GitHub
Contribute to CrashXBETAX/Text_To_Speech_Live_WinUI3_Public development by creating an account on GitHub.
RHVoice/RHVoice: a free and open source speech synthesizer for Russian and other languages
a free and open source speech synthesizer for Russian and other languages - RHVoice/RHVoice
muflone/gespeaker · GitHub
A text to speech GTK+ front-end for eSpeak and mbrola to play a text in many languages with settings for voice, pitch, volume and speed - muflone/gespeaker