Wondera.ai - Your AI Music Co-creator
zeke/kontext-realtime: Create and edit images using your voice
Create and edit images using your voice. Contribute to zeke/kontext-realtime development by creating an account on GitHub.
Replicate
MiniMax Audio: Create Lifelike Audio
Unlock our advanced technology to create lifelike speech in multiple languages, with diverse voices and accents.
10 Best Free AI Voice Cloning Tools: Tested
What are the best free AI voice cloning tools? Find out as I test 10 of them, including MiniMax, Descript, VEED, PlayHT, Voice AI, Uberduck, Vocloner, and more.
Proactor AI
Proactor AI is the first proactive AI agent — a context-aware, memory-augmented teammate that transcribes your conversations, identifies needs, and takes real-time action before you ask.
OpenAudio
A Spark Between Voice and Text. OpenAudio has 17 repositories available. Follow their code on GitHub.
Unmute: Make LLMs listen and speak
Dia 1.6B - a Hugging Face Space by nari-labs
This app converts written text into speech. Users can provide text and an optional audio prompt to guide the voice and style. The app outputs the generated audio.
Aqua Voice
Fast speech-to-text for Mac and Windows. Responses in as little as 450ms. Create prompts, notes, messages, and docs with just your voice.
kyutai-labs/moshi: Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. - kyutai-labs/moshi
fixie-ai/ultravox: A fast multimodal LLM for real-time voice
A fast multimodal LLM for real-time voice. Contribute to fixie-ai/ultravox development by creating an account on GitHub.
NoteX: AI Note Taking, Voice To Notes
Transform any content into smart study material with NoteX, your all-in-one AI learning companion. NoteX revolutionizes how you capture, understand, and retain information.
Sesame
We believe in a future where computers are lifelike. Where they can see, hear, and collaborate with us – as we do with each other. With this vision, we're designing a new kind of computer.
Notis | Voice to Notion — right from your phone
Capture, organize, and find anything for you with your voice —right from your phone. Create meeting notes, memos, emails, articles and more when you're away from your desk. Try it for free!
OASIS - Perfect Writing. Zero Effort.
All you do is talk. AI does the rest.
describy
AI voice conversations to get feedback from your users for your web app.
Real-time voice, video, and AI for developers - Daily
Daily is the enterprise WebRTC platform to build real-time voice, video and AI at scale. Industry-leading SLAs, analytics, and APIs since 2016.
Pricing | Rime
Voices.ink: Transcribe Voice Notes Directly into Notion
Transform your spoken ideas into text effortlessly with Voices.ink. Integrate with Notion and elevate your note-taking experience. Try it for free!
Outspeed | Platform for Realtime Voice and Video AI
Outspeed provides networking and inference infrastructure to build fast, real time voice and video AI apps.
Wispr Flow | Effortless Voice Dictation
Flow makes writing quick and clear with seamless voice dictation. It is the fastest, smartest way to type with your voice.
F5-TTS | Free Online AI Text-to-Speech Synthesis Tool
F5-TTS is a free online real-time text-to-speech synthesis tool that leverages AI to generate natural and expressive speech from text input.
Voicenotes: Transcribe notes, meetings & ask AI
Voicenotes is an intelligent note-taker that let's you transcribe voice notes and meetings in 100+ languages.
Turn conversations into results · Supernormal Voice Agents
Supernormal Voice Agents handle the work for you—whether it’s inbound sales, customer support, running employee surveys, or anything else you can dream up. Build any conversational experience to scale yourself and your business.
SpeechBrain: Open-Source Conversational AI for Everyone
LiveKit
Instantly transport audio and video between LLMs and your users.
pipecat-ai/pipecat: Open Source framework for voice and multimodal conversational AI
Open Source framework for voice and multimodal conversational AI - pipecat-ai/pipecat
Voice and Video powered AI apps - Daily
Easily build audio and video WebRTC capabilities into your AI apps and workflows with Daily's APIs, SDKs and infrastructure.
AI Speech Technology | Speech-To-Text API | Speechmatics
Speechmatics offer the most accurate AI speech technology - with AI transcription & real-time translation components. Try our Speech API today!
kyutai: open science AI lab