GitHub - guillaumekln/faster-whisper: Faster Whisper transcription with CTranslate2
GitHub - KoljaB/AIVoiceChat: Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
GitHub - ZakiGll/story_teller
GitHub - xorbitsai/inference: Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
GitHub - huginn/huginn: Create agents that monitor and act on your behalf. Your agents are standing by!
ElevenLabs Text Input Streaming demo for LLMs
GitHub - Anemolo/Infinite-Lights
joonspk-research/generative_agents
googlecreativelab/quickdraw-dataset: Documentation on how to access and use the Quick, Draw! Dataset.
Retrieval-based-Voice-Conversion-WebUI/docs/training_tips_en.md at main · RVC-Project/Retrieval-based-Voice-Conversion-WebUI
GitHub - tin2tin/Generative_AI: Text to video, image and audio in Blender Video Sequence Editor using Modelscope, Zeroscope (SD, XL, upscale to XL), Animov, Potat1, Stable Diffusion(1.5, 2.0, XL), Deep Floyd IF, AudioLDM and Bark.