Audio computing

172 bookmarks
Newest
NVIDIA PersonaPlex: Natural Conversational AI With Any Role and Voice - NVIDIA ADLR
NVIDIA PersonaPlex: Natural Conversational AI With Any Role and Voice - NVIDIA ADLR
We introduce PersonaPlex, a full-duplex conversational AI model that enables natural conversations with customizable voices and roles. PersonaPlex handles interruptions and backchannels while maintaining any chosen persona, outperforming existing systems on conversational dynamics and task adherence.
·research.nvidia.com·
NVIDIA PersonaPlex: Natural Conversational AI With Any Role and Voice - NVIDIA ADLR
QwenLM/Qwen3-TTS: Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice cloning.
QwenLM/Qwen3-TTS: Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice cloning.
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice...
·github.com·
QwenLM/Qwen3-TTS: Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice cloning.
Dolphin - ScreenReader
Dolphin - ScreenReader
Dolphin ScreenReader is fast and reliable screen reading software for blind people. Customise for a fully accessible screen reading experience.
·yourdolphin.com·
Dolphin - ScreenReader
Microsoft Speech SDK 5.1 - Microsoft Download Center
Microsoft Speech SDK 5.1 - Microsoft Download Center
The Microsoft Speech SDK 5.1 adds Automation support to the features of the previous version of the Speech SDK. You can now use the Win32 Speech API (SAPI) to develop speech applications with Visual Basic ®, ECMAScript and other Automation languages.
·microsoft.com·
Microsoft Speech SDK 5.1 - Microsoft Download Center