Tunable - Instrument and Skill Tuner by AffinityBlue

Audio
gstraube/cythara · GitHub
A musical instrument tuner for Android.
thetwom/Tuner · GitHub
Tuner app.
DuRT - Speech Recognition
Description will go into a meta tag in head /
Transcriptor
Convert voice to text in real time! The UI couldn't be simpler!
You can edit, search and share all your transcriptions.
Your transcriptions are automatically saved to iCloud.
Supported languages: English, Arabic, Chinese, Dutch, French, German, Hindi, Indonesian, Italian, Japanese, Korean, Polish…
ken107/piper-browser-extension · GitHub
Provides Piper neural text-to-speech voices as a browser extension - ken107/piper-browser-extension
TTSMaker - Free Text to Speech Online
TTSMaker is a free text-to-speech tool and an online text reader that can convert text to speech, as an AI voice generator, it supports 100+ languages and 300+ voice styles, powerful neural network makes speech sound more natural, you can listen online, or download audio files in mp3, wav format.
Soundslice | Create living sheet music
Learn music better with our living sheet music.
abus-aikorea/voice-pro · GitHub
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube dow...
dweymouth/supersonic: A lightweight and full-featured cross-platform desktop client for self-hosted music servers
A lightweight and full-featured cross-platform desktop client for self-hosted music servers - dweymouth/supersonic
Cozy
Audiobooks on Linux
Draw.Audio - Draw something, then listen to it
Draw.Audio is a free musical sketch-pad for exploring ideas in sound.
X to Voice | ElevenLabs
Analyze your X profile to generate a unique voice using ElevenLabs' new Voice Design feature
open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi...
nicotine-plus/nicotine-plus: Graphical client for the Soulseek peer-to-peer network
Graphical client for the Soulseek peer-to-peer network - nicotine-plus/nicotine-plus
Swing Music
Just playing around.
AutoEq
Automatic headphone equalization
Parrot AI - Celebrity Voice Generator
Parrot AI is the top celebrity voice generator. Create fun audio clips to roast your friends, send birthday messages, and light up your group chat!
w-okada/voice-changer · GitHub
リアルタイムボイスチェンジャー Realtime Voice Changer. Contribute to w-okada/voice-changer development by creating an account on GitHub.
RVC-Project/Retrieval-based-Voice-Conversion-WebUI: Easily train a good VC model with voice data = 10 mins!
Easily train a good VC model with voice data
Moises App: The Musician's App | Vocal Remover & much more
The best app for practicing music. Remove vocals, separate instruments, master your tracks, and remix songs with the power of AI. Try it today!
neonbjb/tortoise-tts · GitHub
A multi-voice TTS system trained with an emphasis on quality - neonbjb/tortoise-tts
SWivid/F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" - SWivid/F5-TTS
Applio
At the forefront of innovation as an open-source ecosystem that hosts cutting-edge AI voice cloning technologies.
FxSound - Boost Volume and Sound Quality on Your PC
This new software boosts sound quality, volume, clarity and bass on your PC. FxSound will make your audio jump out of your speakers.
HoldSpeak - Type 3x faster with AI powered voice-to-text
HoldSpeak is a AI-powered app that allows you to type 3x faster
voxforge.org - Free Speech... Recognition (Linux, Windows and Mac)
VALL-E
VALL-E is a neural codec language model using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather. VALL-E emerges in-context learning capabilities and can be used to synthesize high-quality personalized speech with only a 3-second enrolled recording of an unseen speaker as a prompt. We also extend VALL-E and train a multi-lingual conditional codec language model. VALL-E X can generate high-quality speech in the target language via just one speech utterance in the source language as a prompt while preserving the unseen speaker’s voice, emotion, and acoustic environment.
Buitar - 首页
WhisperSpeech/WhisperSpeech · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.