Search Audio computing

Found 46 bookmarks

Newest

ggml-org/whisper.cpp: Port of OpenAI's Whisper model in C/C++

Port of OpenAI's Whisper model in C/C++. Contribute to ggml-org/whisper.cpp development by creating an account on GitHub.

Speech recognition #Source Code: GitHub #Type: Open-Source #Software: AI

·github.com·Dec 10, 2025

ggml-org/whisper.cpp: Port of OpenAI's Whisper model in C/C++

rwth-i6/rasr: The RWTH ASR Toolkit.

The RWTH ASR Toolkit. Contribute to rwth-i6/rasr development by creating an account on GitHub.

Speech recognition #Source Code: GitHub #Type: Open-Source

·github.com·Dec 10, 2025

rwth-i6/rasr: The RWTH ASR Toolkit.

CMUSphinx Open Source Speech Recognition

Source-code: https://github.com/cmusphinx/pocketsphinx/

Speech recognition #Source Code: GitHub #Type: Open-Source

·cmusphinx.github.io·Dec 10, 2025

CMUSphinx Open Source Speech Recognition

Otosaku/OtosakuTTS-iOS · GitHub

Swift library for offline text-to-speech synthesis on iOS/macOS. Generate natural speech directly on device using CoreML-optimized FastPitch and HiFiGAN models. No internet required, fully priv...

TTS Engine #Source Code: GitHub #Type: Open-Source #Software: Library

·github.com·Dec 10, 2025

Otosaku/OtosakuTTS-iOS · GitHub

Edge TTS Demo

Source-code: https://github.com/andresayac/edge-tts, https://github.com/andresayac/edge-tts-php

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·edge-tts.dayax.net·Oct 17, 2025

Edge TTS Demo

gexgd0419/NaturalVoiceSAPIAdapter: Make Azure natural TTS voices accessible to any SAPI 5-compatible application.

Make Azure natural TTS voices accessible to any SAPI 5-compatible application. - gexgd0419/NaturalVoiceSAPIAdapter

TTS model #Source Code: GitHub #Type: Open-Source

·github.com·Oct 17, 2025

gexgd0419/NaturalVoiceSAPIAdapter: Make Azure natural TTS voices accessible to any SAPI 5-compatible application.

resemble-ai/chatterbox: SoTA open-source TTS

SoTA open-source TTS. Contribute to resemble-ai/chatterbox development by creating an account on GitHub.

TTS Engine #Source Code: GitHub #Type: Open-Source #Software: AI

·github.com·Sep 4, 2025

resemble-ai/chatterbox: SoTA open-source TTS

yl4579/StyleTTS2: StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models - yl4579/StyleTTS2

TTS model #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2025

yl4579/StyleTTS2: StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

ProperCode/Work-by-Speech: Windows app which allows efficient work on a computer by speech alone.

Windows app which allows efficient work on a computer by speech alone. - ProperCode/Work-by-Speech

Voice control #Source Code: GitHub #Type: Open-Source

·github.com·May 4, 2025

ProperCode/Work-by-Speech: Windows app which allows efficient work on a computer by speech alone.

nari-labs/dia: A TTS model capable of generating ltra-realistic dialogue in one pass.

A TTS model capable of generating ultra-realistic dialogue in one pass. - nari-labs/dia

TTS model #Source Code: GitHub #Type: Open-Source

·github.com·May 3, 2025

nari-labs/dia: A TTS model capable of generating ltra-realistic dialogue in one pass.

SesameAILabs/csm: A Conversational Speech Generation Model

A Conversational Speech Generation Model. Contribute to SesameAILabs/csm development by creating an account on GitHub.

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·May 3, 2025

SesameAILabs/csm: A Conversational Speech Generation Model

Plachtaa/VALL-E-X: An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/ - Plachtaa/VALL-E-X

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Mar 23, 2025

Plachtaa/VALL-E-X: An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

TTS-WebUI

Source-code: https://github.com/rsxdalv/TTS-WebUI

TTS Synthesizer #❤️#Source Code: GitHub #Type: Open-Source

·ttswebui.com·Mar 23, 2025

TTS-WebUI

ken107/piper-browser-extension · GitHub

Provides Piper neural text-to-speech voices as a browser extension - ken107/piper-browser-extension

TTS model #Source Code: GitHub #Type: Open-Source

·github.com·Dec 4, 2024

ken107/piper-browser-extension · GitHub

abus-aikorea/voice-pro · GitHub

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube dow...

Speech recognition #Source Code: GitHub #❤️#Type: Open-Source

·github.com·Nov 30, 2024

abus-aikorea/voice-pro · GitHub

open-mmlab/Amphion: Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi...

#Source Code: GitHub #Type: Open-Source

·github.com·Oct 28, 2024

w-okada/voice-changer · GitHub

リアルタイムボイスチェンジャー Realtime Voice Changer. Contribute to w-okada/voice-changer development by creating an account on GitHub.

#Source Code: GitHub #Type: Open-Source

·github.com·Oct 26, 2024

w-okada/voice-changer · GitHub

RVC-Project/Retrieval-based-Voice-Conversion-WebUI: Easily train a good VC model with voice data = 10 mins!

Easily train a good VC model with voice data

#Source Code: GitHub #Type: Open-Source

·github.com·Oct 26, 2024

RVC-Project/Retrieval-based-Voice-Conversion-WebUI: Easily train a good VC model with voice data = 10 mins!

neonbjb/tortoise-tts · GitHub

A multi-voice TTS system trained with an emphasis on quality - neonbjb/tortoise-tts

TTS model #Source Code: GitHub #❤️#Type: Open-Source

·github.com·Oct 19, 2024

neonbjb/tortoise-tts · GitHub

SWivid/F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" - SWivid/F5-TTS

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Oct 19, 2024

SWivid/F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

huggingface/parler-tts · GitHub

Inference and training library for high-quality TTS models. - huggingface/parler-tts

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Sep 16, 2024

huggingface/parler-tts · GitHub

coqui-ai/TTS · GitHub

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production - coqui-ai/TTS

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Sep 16, 2024

coqui-ai/TTS · GitHub

fishaudio/fish-speech: Brand new TTS solution

Brand new TTS solution. Contribute to fishaudio/fish-speech development by creating an account on GitHub.

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Sep 14, 2024

fishaudio/fish-speech: Brand new TTS solution

mkiol/dsnote: Speech Note Linux app

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation. - mkiol/dsnote

TTS Synthesizer #Source Code: GitHub #❤️#Type: Open-Source #OS compatibility: Linux

·github.com·Jul 20, 2024

mkiol/dsnote: Speech Note Linux app

ihuguet/picotts · GitHub

Pico TTS: text to speech voice sinthesizer from SVox, included in Android AOSP - ihuguet/picotts

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

ihuguet/picotts · GitHub

thorstenMueller/Thorsten-Voice · GitHub

Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling. - thorstenMueller/Thorsten-Voice

TTS Engine #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

thorstenMueller/Thorsten-Voice · GitHub

nvaccess/nvda: NVDA, the free and open source Screen Reader for Microsoft Windows

NVDA, the free and open source Screen Reader for Microsoft Windows - nvaccess/nvda

Screen Reader #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

nvaccess/nvda: NVDA, the free and open source Screen Reader for Microsoft Windows

evuraan/mintPiper: Make Linux speak what's on the screen: clearly and securely.

Make Linux speak what's on the screen: clearly and securely. - evuraan/mintPiper

Screen Reader #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

evuraan/mintPiper: Make Linux speak what's on the screen: clearly and securely.

davidacm/NVDA-IBMTTS-Driver · GitHub

This project is aimed at developing and maintaining the NVDA IBMTTS driver. IBMTTS is a synthesizer similar to Eloquence. Please send your ideas and contributions here! - davidacm/NVDA-IBMTTS-Driver

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

davidacm/NVDA-IBMTTS-Driver · GitHub

mush42/sonata-nvda · GitHub

This add-on implements a speech synthesizer driver for NVDA using neural TTS models. It supports Piper - mush42/sonata-nvda

TTS Synthesizer #Source Code: GitHub #Type: Open-Source

·github.com·Jul 6, 2024

mush42/sonata-nvda · GitHub