Audio computing

170 bookmarks

Newest

voxforge.org - Free Speech... Recognition (Linux, Windows and Mac)

Speech recognition #Type: Open-Source

·voxforge.org·Oct 2, 2024

voxforge.org - Free Speech... Recognition (Linux, Windows and Mac)

VALL-E

VALL-E is a neural codec language model using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather. VALL-E emerges in-context learning capabilities and can be used to synthesize high-quality personalized speech with only a 3-second enrolled recording of an unseen speaker as a prompt. We also extend VALL-E and train a multi-lingual conditional codec language model. VALL-E X can generate high-quality speech in the target language via just one speech utterance in the source language as a prompt while preserving the unseen speaker’s voice, emotion, and acoustic environment.

TTS Synthesizer

·microsoft.com·Sep 25, 2024

VALL-E

WhisperSpeech/WhisperSpeech · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Speech recognition #Source Code: Hugging Face #Type: Open-Source

·huggingface.co·Sep 16, 2024

WhisperSpeech/WhisperSpeech · Hugging Face

huggingface/parler-tts · GitHub

Inference and training library for high-quality TTS models. - huggingface/parler-tts

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Sep 16, 2024

huggingface/parler-tts · GitHub

DiTTo-TTS

TTS Engine

·ditto-tts.github.io·Sep 16, 2024

DiTTo-TTS

coqui-ai/TTS · GitHub

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production - coqui-ai/TTS

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Sep 16, 2024

coqui-ai/TTS · GitHub

Voicery Text-to-Speech

Voicery creates natural-sounding Text-to-Speech (TTS) engines and custom brand voices for enterprise. Our solutions leverage cutting-edge deep-learning research optimized for your business use-case and technical infrastructure.

TTS Synthesizer #Software: AI

·voicery.com·Sep 15, 2024

Voicery Text-to-Speech

fishaudio/fish-speech: Brand new TTS solution

Brand new TTS solution. Contribute to fishaudio/fish-speech development by creating an account on GitHub.

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Sep 14, 2024

fishaudio/fish-speech: Brand new TTS solution

Audiomatic

Audio translated automatically using AI voice cloning technology.

TTS Synthesizer #Software: AI

·audiomatic.app·Aug 16, 2024

Audiomatic

Parlatype

GNOME audio player for transcription

Speech recognition

·parlatype.xyz·Jul 22, 2024

Parlatype

bigWav.app - Private audio transcription & annotation

bigWav: free and private audio transcription and annotation

Speech recognition #OS Compatibility: web app

·bigwav.app·Jul 20, 2024

bigWav.app - Private audio transcription & annotation

Lovo - AI Voice Generator: Realistic Text to Speech & Voice Cloning

Award-winning AI Voice Generator and text to speech software with 500+ voices in 100 languages. Realistic AI Voices with Online Video Editor. Clone your own voice.

TTS Synthesizer #Software: AI #OS Compatibility: web app

·lovo.ai·Jul 20, 2024

Lovo - AI Voice Generator: Realistic Text to Speech & Voice Cloning

mkiol/dsnote: Speech Note Linux app

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation. - mkiol/dsnote

TTS Synthesizer #Source Code: GitHub #❤️#Type: Open-Source #OS compatibility: Linux

·github.com·Jul 20, 2024

mkiol/dsnote: Speech Note Linux app

Speech to Note - Voice to Text, Note Speech & Speak Writer Solution

Explore Speech to Note for top-notch voice to text, note speech, and speak writer solutions. Our AI technology powered by GPT-4o ensures easy conversion of your voice into written notes.

Speech recognition #Software: AI

·speechtonote.com·Jul 15, 2024

Speech to Note - Voice to Text, Note Speech & Speak Writer Solution

ihuguet/picotts · GitHub

Pico TTS: text to speech voice sinthesizer from SVox, included in Android AOSP - ihuguet/picotts

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

ihuguet/picotts · GitHub

IBM Watson - Text to Speech

Watson Speech to Text is an API that transcribes speech to text in a variety of languages. It’s available as SaaS or for self-hosting.

TTS Synthesizer #OS Compatibility: web app

·ibm.com·Jul 6, 2024

IBM Watson - Text to Speech

Google Cloud - Text-to-Speech AI

Turn text into natural-sounding speech in 220+ voices across 40+ languages and variants with an API powered by Google’s machine learning technology.

TTS Synthesizer #Company: Alphabet (Google)

·cloud.google.com·Jul 6, 2024

Google Cloud - Text-to-Speech AI

Deutsche AI/KI TTS-Stimme kostenlos mit Thorsten-Voice

Das Thorsten-Voice Projekt stellt kostenlos deutsche, AI/KI erzeugte Text to Speech (TTS) Stimmen bereit die ohne Internet funktionieren.

TTS Synthesizer

·thorsten-voice.de·Jul 6, 2024

Deutsche AI/KI TTS-Stimme kostenlos mit Thorsten-Voice

thorstenMueller/Thorsten-Voice · GitHub

Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling. - thorstenMueller/Thorsten-Voice

TTS Engine #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

thorstenMueller/Thorsten-Voice · GitHub

Open Voices

Home page of OVOS

Voice control #Software: AI #Type: Open-Source

·openvoiceos.org·Jul 6, 2024

Open Voices

nvaccess/nvda: NVDA, the free and open source Screen Reader for Microsoft Windows

NVDA, the free and open source Screen Reader for Microsoft Windows - nvaccess/nvda

Screen Reader #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

nvaccess/nvda: NVDA, the free and open source Screen Reader for Microsoft Windows

evuraan/mintPiper: Make Linux speak what's on the screen: clearly and securely.

Make Linux speak what's on the screen: clearly and securely. - evuraan/mintPiper

Screen Reader #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

evuraan/mintPiper: Make Linux speak what's on the screen: clearly and securely.

davidacm/NVDA-IBMTTS-Driver · GitHub

This project is aimed at developing and maintaining the NVDA IBMTTS driver. IBMTTS is a synthesizer similar to Eloquence. Please send your ideas and contributions here! - davidacm/NVDA-IBMTTS-Driver

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

davidacm/NVDA-IBMTTS-Driver · GitHub

mush42/sonata-nvda · GitHub

This add-on implements a speech synthesizer driver for NVDA using neural TTS models. It supports Piper - mush42/sonata-nvda

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

mush42/sonata-nvda · GitHub

CrashXBETAX/Text_To_Speech_Live_WinUI3_Public · GitHub

Contribute to CrashXBETAX/Text_To_Speech_Live_WinUI3_Public development by creating an account on GitHub.

TTS model #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

CrashXBETAX/Text_To_Speech_Live_WinUI3_Public · GitHub

RHVoice/RHVoice: a free and open source speech synthesizer for Russian and other languages

a free and open source speech synthesizer for Russian and other languages - RHVoice/RHVoice

TTS Engine #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

RHVoice/RHVoice: a free and open source speech synthesizer for Russian and other languages

RHVoice.org

TTS Synthesizer

·rhvoice.org·Jul 6, 2024

RHVoice.org

muflone/gespeaker · GitHub

A text to speech GTK+ front-end for eSpeak and mbrola to play a text in many languages with settings for voice, pitch, volume and speed - muflone/gespeaker

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

muflone/gespeaker · GitHub