Edge TTS Demo
travisvn/openai-edge-tts: Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs
Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs - travisvn/openai-edge-tts
gexgd0419/NaturalVoiceSAPIAdapter: Make Azure natural TTS voices accessible to any SAPI 5-compatible application.
Make Azure natural TTS voices accessible to any SAPI 5-compatible application. - gexgd0419/NaturalVoiceSAPIAdapter
resemble-ai/chatterbox: SoTA open-source TTS
SoTA open-source TTS. Contribute to resemble-ai/chatterbox development by creating an account on GitHub.
namidaco/namida: A Beautiful and Feature-rich Music & Video Player with Youtube Support, Built in Flutter
A Beautiful and Feature-rich Music & Video Player with Youtube Support, Built in Flutter - namidaco/namida
Nokse22/high-tide: Libadwaita TIDAL client for Linux
Libadwaita TIDAL client for Linux. Contribute to Nokse22/high-tide development by creating an account on GitHub.
myshell-ai/OpenVoice: Instant voice cloning by MyShell
Instant voice cloning by MyShell. Contribute to myshell-ai/OpenVoice development by creating an account on GitHub.
yl4579/StyleTTS2: StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models - yl4579/StyleTTS2
Tuneo - Apps on Google Play
Open Source Guitar Tuner
Revisto/drum-machine: A drum machine application, built with Python, GTK4, libadwaita, and Pygame.
A drum machine application, built with Python, GTK4, libadwaita, and Pygame. - Revisto/drum-machine
abus-aikorea/voice-pro · GitHub
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube dow...
ProperCode/Work-by-Speech: Windows app which allows efficient work on a computer by speech alone.
Windows app which allows efficient work on a computer by speech alone. - ProperCode/Work-by-Speech
evuraan/mintPiper: Make Linux speak what's on the screen: clearly and securely.
Make Linux speak what's on the screen: clearly and securely. - evuraan/mintPiper
muflone/gespeaker · GitHub
A text to speech GTK+ front-end for eSpeak and mbrola to play a text in many languages with settings for voice, pitch, volume and speed - muflone/gespeaker
mkiol/dsnote: Speech Note Linux app
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation. - mkiol/dsnote
rsxdalv/tts-generation-webui: TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS) - rsxdalv/tts-generation-webui
neonbjb/tortoise-tts · GitHub
A multi-voice TTS system trained with an emphasis on quality - neonbjb/tortoise-tts
nari-labs/dia: A TTS model capable of generating ltra-realistic dialogue in one pass.
A TTS model capable of generating ultra-realistic dialogue in one pass. - nari-labs/dia
SesameAILabs/csm: A Conversational Speech Generation Model
A Conversational Speech Generation Model. Contribute to SesameAILabs/csm development by creating an account on GitHub.
rakuri255/UltraSinger: AI based tool to convert vocals lyrics and pitch from music to autogenerate Ultrastar Deluxe, Midi and notes. It automatic tapping, adding text, pitch vocals and creates karaoke files
AI based tool to convert vocals lyrics and pitch from music to autogenerate Ultrastar Deluxe, Midi and notes. It automatic tapping, adding text, pitch vocals and creates karaoke files. - rakuri255/...
Plachtaa/VALL-E-X: An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/ - Plachtaa/VALL-E-X
talwat/lowfi: An extremely simple lofi player.
An extremely simple lofi player. Contribute to talwat/lowfi development by creating an account on GitHub.
ravachol/kew: A terminal music player.
A terminal music player. Contribute to ravachol/kew development by creating an account on GitHub.
z-huang/InnerTune: A Material 3 YouTube Music client for Android
A Material 3 YouTube Music client for Android. Contribute to z-huang/InnerTune development by creating an account on GitHub.
gstraube/cythara · GitHub
A musical instrument tuner for Android.
thetwom/Tuner · GitHub
Tuner app.
ken107/piper-browser-extension · GitHub
Provides Piper neural text-to-speech voices as a browser extension - ken107/piper-browser-extension
dweymouth/supersonic: A lightweight and full-featured cross-platform desktop client for self-hosted music servers
A lightweight and full-featured cross-platform desktop client for self-hosted music servers - dweymouth/supersonic
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi...
nicotine-plus/nicotine-plus: Graphical client for the Soulseek peer-to-peer network
Graphical client for the Soulseek peer-to-peer network - nicotine-plus/nicotine-plus