-语音-文字-视频-

327 bookmarks

Newest

Video To Blog

Instantly convert videos into high quality, SEO optimized blog posts complete with screenshots, AI generated images, internal/external links, CTAs, and more. Try for free.

音视频转文字

·videotoblog.ai·Jan 8, 2024

Video To Blog

GitHub - Vaibhavs10/insanely-fast-whisper

音视频转文字

·github.com·Jan 8, 2024

GitHub - Vaibhavs10/insanely-fast-whisper

Reccap - Reccap 使 YouTube 视频可浏览

Break down videos into visually rich, structured insights. Perfect for students, professionals, and lifelong learners seeking to master complex content quickly.

音视频转文字

·reccap.it·Jan 8, 2024

Reccap - Reccap 使 YouTube 视频可浏览

pods.ee | AI tool for podcast listeners

音视频转文字

·pods.ee·Jan 8, 2024

pods.ee | AI tool for podcast listeners

Moonvalley: Animate your ideas

The imagination research company building ML video and image models that captivate.

文字转音视频

·moonvalley.ai·Jan 8, 2024

Moonvalley: Animate your ideas

Podcasts - A Free Multi-Platform Podcast Player

Podcasts is a light weight browser extension. It will change the way you listen to podcast.

音视频转文字

·podcasts.bluepill.life·Jan 8, 2024

Podcasts - A Free Multi-Platform Podcast Player

alibaba-damo-academy/FunASR: A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc. - modelscope/FunASR

音视频转文字

·github.com·Jan 8, 2024

alibaba-damo-academy/FunASR: A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

TalkNotes - Turn messy thoughts into actionable notes. Fast.

Turn hours of note taking into seconds. Record voice notes, and let the AI transcribe & structure them into actionable text. Create task lists, transcripts, blog posts, and more! Works in 50+ languages.

音视频转文字

·talknotes.io·Jan 8, 2024

TalkNotes - Turn messy thoughts into actionable notes. Fast.

distil-whisper,用于语音识别的 Whisper 的蒸馏变体,下载distil-whisper的源码_GitHub_酷徒速度提高 6 倍，尺寸缩小 50%，字错误率控制在 1% 以内。

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate. - huggingface/distil-whisper

音视频转文字

·github.com·Jan 8, 2024

distil-whisper,用于语音识别的 Whisper 的蒸馏变体,下载distil-whisper的源码_GitHub_酷徒速度提高 6 倍，尺寸缩小 50%，字错误率控制在 1% 以内。

Whisper Turbo

Transcribe any audio file - completely free!

文字转音视频

·whisper-turbo.com·Jan 8, 2024

Whisper Turbo

GitHub - LokerL/tts-vue at 1.9.10 --- GitHub - LokerL/tts-vue 在 1.9.10

🎤 微软语音合成工具，使用 Electron + Vue + ElementPlus + Vite 构建。

文字转音视频

·github.com·Jan 8, 2024

GitHub - LokerL/tts-vue at 1.9.10 --- GitHub - LokerL/tts-vue 在 1.9.10

Steve.AI | AI Video Generator Tool to create videos using Text

Patented AI video Generator to create +5 video styles. Best AI tool with Hyper-realistic voices, +300 Characters, and +50 GenerativeAI templates.

文字转音视频

·steve.ai·Jan 8, 2024

Steve.AI | AI Video Generator Tool to create videos using Text

guillaumekln/faster-whisper: Faster Whisper transcription with CTranslate2 --- guillaumekln/faster-whisper：使用 CTranslate2 加快 Whisper 转录速度

Faster Whisper transcription with CTranslate2.

音视频转文字

·github.com·Jan 8, 2024

guillaumekln/faster-whisper: Faster Whisper transcription with CTranslate2 --- guillaumekln/faster-whisper：使用 CTranslate2 加快 Whisper 转录速度

WhisperNotes：通过音频捕捉您的想法

Capture your thoughts via audio and let WhisperNotes transcribe them into text based notes via the power of AI. Can tag, search and edit notes.

音视频转文字

·whispernotes.xyz·Jan 8, 2024

WhisperNotes：通过音频捕捉您的想法

Cockatoo - 使用 AI 将音频和视频转换为文本

音视频转文字

·cockatoo.com·Jan 8, 2024

Cockatoo - 使用 AI 将音频和视频转换为文本

Home - Dr. Lambda

音视频转文字

·drlambda.ai·Jan 8, 2024

Home - Dr. Lambda

玄墨。使用 AI 创建视频、图像和 3D 对象。

Genmo trains the world's best open video generation models. Create incredible videos with AI at Genmo

文字转音视频

·genmo.ai·Jan 8, 2024

玄墨。使用 AI 创建视频、图像和 3D 对象。

记录员/总结员

This is a recorder using rev that's recording and then summarizing

音视频转文字

·summarai.app·Jan 8, 2024

记录员/总结员

Seamless Communication Translation Demo

Create translations that follow your speech style. Translate from nearly 100 input languages into 35 output languages. This is a translation research demo powered by AI.

音视频转文字

·seamless.metademolab.com·Jan 8, 2024

Seamless Communication Translation Demo

Accurate AI Transcriptions in Minutes | Powered by Riverside --- 几分钟内准确的 AI 转录 |由河滨提供动力

Transcribe audio and video in 100+ languages with just a few clicks! Riverside's transcriber offers accurate AI transcriptions completely free!

音视频转文字

·riverside.fm·Jan 8, 2024

Accurate AI Transcriptions in Minutes | Powered by Riverside --- 几分钟内准确的 AI 转录 |由河滨提供动力

Whisper Web - Xenova 的拥抱空间

Discover amazing ML apps made by the community

音视频转文字

·huggingface.co·Jan 8, 2024

Whisper Web - Xenova 的拥抱空间

open-mmlab/Amphion：Amphion (/æmˈfaɪən/) 是一个用于音频、音乐和语音生成的工具包。其目的是支持可重复的研究，并帮助初级研究人员和工程师开始音频、音乐和语音生成研究和开发领域。

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi...

文字转音视频

·github.com·Jan 8, 2024

WonderJourney

Project website for WonderJourney

文字转音视频

·kovenyu.com·Jan 8, 2024

WonderJourney

录制

音视频转文字

·transcribe.bloat.app·Jan 8, 2024

录制

Whisper JAX - sanchit-gandhi 的拥抱空间

Discover amazing ML apps made by the community

音视频转文字

·huggingface.co·Jan 8, 2024

Whisper JAX - sanchit-gandhi 的拥抱空间

GitHub - Const-me/Whisper: OpenAI 的 Whisper 自动语音识别 (ASR) 模型的高性能 GPGPU 推理 --- Whisper,OpenAI 的 Whisper 自动语音识别 (ASR) 模型的性能 GPGPU 推理,下载Whisper 的源码_GitHub _帮酷

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model - Const-me/Whisper

音视频转文字

·github.com·Jan 8, 2024

GitHub - ggerganov/whisper.cpp: Port of OpenAI's Whisper model in C/C++ --- Whisper.cpp,C/C++ 中 OpenAI 的 Whisper 模型的端口,下载whisper.cpp的源码_GitHub_帮酷

Port of OpenAI's Whisper model in C/C++.

音视频转文字

·github.com·Jan 8, 2024

GitHub - ggerganov/whisper.cpp: Port of OpenAI's Whisper model in C/C++ --- Whisper.cpp,C/C++ 中 OpenAI 的 Whisper 模型的端口,下载whisper.cpp的源码_GitHub_帮酷

Turn ideas into videos | AI video creator | invideo AI --- 将想法转化为视频 | AI视频创作者 |我羡慕人工智能

Type your video idea and get a full-length with generated with AI clips, stock media, voiceover, subtitles and much more.

文字转音视频

·invideo.io·Jan 8, 2024

Turn ideas into videos | AI video creator | invideo AI --- 将想法转化为视频 | AI视频创作者 |我羡慕人工智能

GitHub - m-bain/whisperX: WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) --- WhisperX,WhisperX: 使用字级时间戳（& Diarization）的自动语音识别,下载whisperX的源码_GitHub_帮酷

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) - m-bain/whisperX

音视频转文字

·github.com·Jan 8, 2024

DeepSpeech/doc/index.rst at r0.9 · mozilla/DeepSpeech · GitHub --- DeepSpeech/doc/index.rst at r0.9 · mozilla/DeepSpeech · GitHub

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. - mozilla/DeepSpeech

音视频转文字

·github.com·Jan 8, 2024

DeepSpeech/doc/index.rst at r0.9 · mozilla/DeepSpeech · GitHub --- DeepSpeech/doc/index.rst at r0.9 · mozilla/DeepSpeech · GitHub