Audio & Music

680 bookmarks

Newest

Metavoice - Conversational Speech Model for Voice AI Agents

·tts.metavoice.io·Jul 6, 2025

Metavoice - Conversational Speech Model for Voice AI Agents

yl4579/StyleTTS2: StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models - yl4579/StyleTTS2

TTS model #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2025

yl4579/StyleTTS2: StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Cartesia - The fastest, ultra-realistic voice AI platform

Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. We're pioneering the model architectures that will make it possible.

TTS Synthesizer

·cartesia.ai·Jul 6, 2025

Cartesia - The fastest, ultra-realistic voice AI platform

MiniMax - Intelligence with everyone

MiniMax is a leading global technology company and one of the pioneers of large language models (LLMs) in Asia.

TTS model

·minimax.io·Jul 6, 2025

MiniMax - Intelligence with everyone

Muse - AI for Music Producers

Muse is an AI music production tool that helps you create music faster and easier.

Instrument MIDI

·muse.art·Jul 3, 2025

Muse - AI for Music Producers

Strudel REPL

Source-code: https://codeberg.org/uzu/strudel

Virtual instruments #❤️#Source Code: Codeberg #OS Compatibility: web app #Type: Open-Source

·strudel.cc·Jun 18, 2025

Strudel REPL

Tuneo - Apps on Google Play

Open Source Guitar Tuner

Tuner #Source Code: GitHub #Type: Open-Source

·play.google.com·Jun 17, 2025

Tuneo - Apps on Google Play

WhisperBuddy - AI powered transcription macOS app

AI powered transcription macOS app

Speech recognition #OS compatibility: macOS

·whisperbuddy.com·Jun 6, 2025

WhisperBuddy - AI powered transcription macOS app

The Best 355 AI Speech Synthesis AI Tools - Toolify

Best 355 AI Speech Synthesis AI Tools are: ElevenLabs,Adobe Podcast,Speechify,Descript,TTSMaker, and the newest AI Speech Synthesis Tools.

TTS Synthesizer #Content: List

·toolify.ai·May 16, 2025

The Best 355 AI Speech Synthesis AI Tools - Toolify

Revisto/drum-machine: A drum machine application, built with Python, GTK4, libadwaita, and Pygame.

A drum machine application, built with Python, GTK4, libadwaita, and Pygame. - Revisto/drum-machine

DPM (Drum Pad Machine)#Source Code: GitHub #Type: Open-Source

·github.com·May 12, 2025

Revisto/drum-machine: A drum machine application, built with Python, GTK4, libadwaita, and Pygame.

[untitled]

A sacred place for your work-in-progress music

Music player

·untitled.stream·May 7, 2025

[untitled]

ProperCode/Work-by-Speech: Windows app which allows efficient work on a computer by speech alone.

Windows app which allows efficient work on a computer by speech alone. - ProperCode/Work-by-Speech

Voice control #Source Code: GitHub #Type: Open-Source

·github.com·May 4, 2025

ProperCode/Work-by-Speech: Windows app which allows efficient work on a computer by speech alone.

Microsoft Azure - Azure AI Speech

Explore Azure AI Speech to build generative AI apps faster using pre-built or customizable speech AI models.

TTS model #Company: Microsoft

·azure.microsoft.com·May 3, 2025

Microsoft Azure - Azure AI Speech

Orca

Source-code: https://orca.gnome.org/source.html

Screen Reader #Type: Open-Source #Community: GNOME #OS: Linux-based

·orca.gnome.org·May 3, 2025

Orca

Fish Speech

Targeting SOTA TTS solutions.

TTS Synthesizer

·speech.fish.audio·May 3, 2025

Fish Speech

nari-labs/dia: A TTS model capable of generating ltra-realistic dialogue in one pass.

A TTS model capable of generating ultra-realistic dialogue in one pass. - nari-labs/dia

TTS model #Source Code: GitHub #Type: Open-Source

·github.com·May 3, 2025

nari-labs/dia: A TTS model capable of generating ltra-realistic dialogue in one pass.

SesameAILabs/csm: A Conversational Speech Generation Model

A Conversational Speech Generation Model. Contribute to SesameAILabs/csm development by creating an account on GitHub.

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·May 3, 2025

SesameAILabs/csm: A Conversational Speech Generation Model

xcribe - Free Privacy focused transcription tool for MacOS.

xcribe is a privacy focused transcription tool for MacOS. It uses the latest in speech recognition technology.

Speech recognition #OS compatibility: macOS

·xcribe.app·May 2, 2025

xcribe - Free Privacy focused transcription tool for MacOS.

SpotCompiled

Music client #OS compatibility: iOS

·spotc.yodaluca.dev·Apr 16, 2025

SpotCompiled

Speech Synthesis Online - free text to speech online converter tools

TTS Synthesizer #❤️#OS Compatibility: web app

·speechsynthesis.online·Apr 13, 2025

Speech Synthesis Online - free text to speech online converter tools

SpeechNinja - Type to Speak

SpeechNinja says what you type in real time. It enables people with speech difficulties to speak out loud using synthesized voice (AAC) and more.

TTS Synthesizer #❤️

·speechninja.co·Apr 2, 2025

SpeechNinja - Type to Speak

Audioread - Listen to Article, PDF, Email in Browser or Podcast App

Listen to any article, PDF, email, etc. in your podcast app, our mobile app, or the browser. Audible-like voice. Make reading as easy as putting in your AirPods. Try Audioread for free.

TTS Synthesizer

·audioread.com·Apr 2, 2025

Audioread - Listen to Article, PDF, Email in Browser or Podcast App

Online Microsoft Sam TTS Generator

Online Microsoft SAM, SAPI4, Bonzi Buddy Text to speech generator

TTS Synthesizer

·tetyys.com·Apr 2, 2025

Online Microsoft Sam TTS Generator

Text To Speech Voices & Downloads - Internet Archive

This collection includes multiple Text To Speech software from multiple providers including voices for multiple languages and many SAPI versions with SAPI 3,...

TTS Synthesizer #Site: Internet Archive #Content: List

·archive.org·Apr 2, 2025

Text To Speech Voices & Downloads - Internet Archive

DAB Music Player | High-Resolution Audio

DAB Music Player lets you search, stream, and download music in up to 24bit/192kHz quality for the ultimate audio experience. Enjoy superior sound quality with our high-resolution music player.

Music player

·dabplayer.vercel.app·Apr 2, 2025

DAB Music Player | High-Resolution Audio

Text to Speech - TTS Online Converter Tools

We developed an online text-to-speech synthesis tool, which converts text into natural and smooth human voice, provides 100+ speakers for you to choose, supports multi-language, multi-dialect and Chinese-English mixing, and can configure audio flexibly parameter. It is widely used in news reading, travel navigation, intelligent hardware and notification broadcasting. And can convert the text content into MP3 files to download and save.

TTS Synthesizer #❤️#OS Compatibility: web app

·text-to-speech.online·Apr 2, 2025

Text to Speech - TTS Online Converter Tools

monohex software

Music player #OS compatibility: macOS

·monohex.com·Mar 28, 2025

monohex software

Mic Drop | The party game that tests your lyrical knowledge, adapted for web

Karaoke

·micdrop.gg·Mar 26, 2025

Mic Drop | The party game that tests your lyrical knowledge, adapted for web

UltraStar Play

Karaoke

·ultrastar-play.com·Mar 26, 2025

UltraStar Play

rakuri255/UltraSinger: AI based tool to convert vocals lyrics and pitch from music to autogenerate Ultrastar Deluxe, Midi and notes. It automatic tapping, adding text, pitch vocals and creates karaoke files

AI based tool to convert vocals lyrics and pitch from music to autogenerate Ultrastar Deluxe, Midi and notes. It automatic tapping, adding text, pitch vocals and creates karaoke files. - rakuri255/...

Karaoke #Software: AI #Source Code: GitHub #Type: Open-Source

·github.com·Mar 26, 2025