Voice computing

144 bookmarks

Newest

Sesame

We believe in a future where computers are lifelike. Where they can see, hear, and collaborate with us – as we do with each other. With this vision, we're designing a new kind of computer.

Voice assistant #Software: AI

·sesame.com·Mar 6, 2025

Sesame

AssemblyAI | AI models to transcribe and understand speech

With AssemblyAI's industry-leading Speech AI models, transcribe speech to text and extract insights from your voice data.

Speech To Text

·assemblyai.com·Feb 19, 2025

AssemblyAI | AI models to transcribe and understand speech

Nuance - Dragon Speech Recognition

Work faster and smarter and speed document creation and automate workflows with the world's best-selling speech recognition solution.

Speech To Text

·nuance.com·Dec 28, 2024

Nuance - Dragon Speech Recognition

Text to Speech: Generate natural sounding voices and voice overs

Download voices as MP3. Create phone announcements, YouTube, Explainer, E-learning Videos and more.

Text To Speech interface

·voiceovermaker.io·Dec 28, 2024

Text to Speech: Generate natural sounding voices and voice overs

Web Assist - Surf the Web with just your voice

Web Assist is a browser extension for Edge and Chrome that allows you to browse the web using your voice.

Voice assistant

·webassistextension.com·Dec 28, 2024

Web Assist - Surf the Web with just your voice

Voicy Speech to Text

Speech to Text Chrome Extension Write with your voice on every website. AI-powered dictation tool.

Speech To Text #Software: Extension

·usevoicy.com·Dec 28, 2024

Voicy Speech to Text

DuRT - Speech Recognition

Description will go into a meta tag in head /

Speech recognition #OS compatibility: macOS

·durt.dudufuture.top·Dec 11, 2024

DuRT - Speech Recognition

‎Transcriptor

‎Convert voice to text in real time! The UI couldn't be simpler! You can edit, search and share all your transcriptions. Your transcriptions are automatically saved to iCloud. Supported languages: English, Arabic, Chinese, Dutch, French, German, Hindi, Indonesian, Italian, Japanese, Korean, Polish…

Speech To Text #OS compatibility: macOS

·apps.apple.com·Dec 10, 2024

‎Transcriptor

ken107/piper-browser-extension · GitHub

Provides Piper neural text-to-speech voices as a browser extension - ken107/piper-browser-extension

Text To Speech interface #Source Code: GitHub

·github.com·Dec 4, 2024

ken107/piper-browser-extension · GitHub

TTSMaker - Free Text to Speech Online

TTSMaker is a free text-to-speech tool and an online text reader that can convert text to speech, as an AI voice generator, it supports 100+ languages and 300+ voice styles, powerful neural network makes speech sound more natural, you can listen online, or download audio files in mp3, wav format.

Text To Speech interface

·ttsmaker.com·Dec 4, 2024

TTSMaker - Free Text to Speech Online

abus-aikorea/voice-pro · GitHub

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube dow...

Text To Speech interface #Source Code: GitHub

·github.com·Nov 30, 2024

abus-aikorea/voice-pro · GitHub

X to Voice | ElevenLabs

Analyze your X profile to generate a unique voice using ElevenLabs' new Voice Design feature

#Software: Open-Source

·xtovoice.com·Nov 2, 2024

X to Voice | ElevenLabs

open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi...

#Source Code: GitHub

·github.com·Oct 28, 2024

Parrot AI - Celebrity Voice Generator

Parrot AI is the top celebrity voice generator. Create fun audio clips to roast your friends, send birthday messages, and light up your group chat!

Text To Speech synthesizer

·tryparrotai.com·Oct 26, 2024

Parrot AI - Celebrity Voice Generator

TopMediai: Premier Destination for AI-Powered Audio Tools & More

At TopMediai, explore AI audio tools such as voice cloning, text-to-speech, AI song cover generator, alongside other AI tools. Make your voice charming today.

Text To Speech synthesizer

·topmediai.com·Oct 26, 2024

TopMediai: Premier Destination for AI-Powered Audio Tools & More

w-okada/voice-changer · GitHub

リアルタイムボイスチェンジャー Realtime Voice Changer. Contribute to w-okada/voice-changer development by creating an account on GitHub.

#Source Code: GitHub

·github.com·Oct 26, 2024

w-okada/voice-changer · GitHub

RVC-Project/Retrieval-based-Voice-Conversion-WebUI: Easily train a good VC model with voice data = 10 mins!

Easily train a good VC model with voice data

#Source Code: GitHub

·github.com·Oct 26, 2024

RVC-Project/Retrieval-based-Voice-Conversion-WebUI: Easily train a good VC model with voice data = 10 mins!

neonbjb/tortoise-tts · GitHub

A multi-voice TTS system trained with an emphasis on quality - neonbjb/tortoise-tts

Text To Speech synthesizer #Source Code: GitHub #❤️

·github.com·Oct 19, 2024

neonbjb/tortoise-tts · GitHub

SWivid/F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" - SWivid/F5-TTS

Text To Speech synthesizer #Source Code: GitHub

·github.com·Oct 19, 2024

SWivid/F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Applio

At the forefront of innovation as an open-source ecosystem that hosts cutting-edge AI voice cloning technologies.

Speech recognition #Software: Open-Source

·applio.org·Oct 17, 2024

Applio

HoldSpeak - Type 3x faster with AI powered voice-to-text

HoldSpeak is a AI-powered app that allows you to type 3x faster

Speech To Text

·holdspeak.com·Oct 4, 2024

HoldSpeak - Type 3x faster with AI powered voice-to-text

voxforge.org - Free Speech... Recognition (Linux, Windows and Mac)

Speech recognition

·voxforge.org·Oct 2, 2024

voxforge.org - Free Speech... Recognition (Linux, Windows and Mac)

SoundHound | Technology for a voice-enabled world

Voice AI interfaces for hardware devices, services, vehicles, mobile apps, and more powered by SoundHound's conversational intelligence solutions

Text To Speech interface

·soundhound.com·Sep 25, 2024

SoundHound | Technology for a voice-enabled world

VALL-E

VALL-E is a neural codec language model using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather. VALL-E emerges in-context learning capabilities and can be used to synthesize high-quality personalized speech with only a 3-second enrolled recording of an unseen speaker as a prompt. We also extend VALL-E and train a multi-lingual conditional codec language model. VALL-E X can generate high-quality speech in the target language via just one speech utterance in the source language as a prompt while preserving the unseen speaker’s voice, emotion, and acoustic environment.

Text To Speech synthesizer

·microsoft.com·Sep 25, 2024

VALL-E

WhisperSpeech/WhisperSpeech · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Text To Speech synthesizer

·huggingface.co·Sep 16, 2024

WhisperSpeech/WhisperSpeech · Hugging Face

huggingface/parler-tts · GitHub

Inference and training library for high-quality TTS models. - huggingface/parler-tts

Text To Speech synthesizer #Source Code: GitHub

·github.com·Sep 16, 2024

huggingface/parler-tts · GitHub

DiTTo-TTS

Text To Speech synthesizer

·ditto-tts.github.io·Sep 16, 2024

DiTTo-TTS

coqui-ai/TTS · GitHub

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production - coqui-ai/TTS

Text To Speech synthesizer #Source Code: GitHub

·github.com·Sep 16, 2024

coqui-ai/TTS · GitHub

iSleech | TTS SDK | Speech Recognition (ASR)

Text To Speech synthesizer

·ispeech.org·Sep 15, 2024

iSleech | TTS SDK | Speech Recognition (ASR)

Voicery Text-to-Speech

Voicery creates natural-sounding Text-to-Speech (TTS) engines and custom brand voices for enterprise. Our solutions leverage cutting-edge deep-learning research optimized for your business use-case and technical infrastructure.

Text To Speech synthesizer #Software: AI

·voicery.com·Sep 15, 2024

Voicery Text-to-Speech