articleGitHub Trending — Python
microsoft/VibeVoice
VibeVoice is an open-source family of frontier voice AI models (ASR and TTS) using 7.5 Hz continuous speech tokenizers and a next-token diffusion framework guided by an LLM. It enables 60-minute long-form ASR with diarization and custom hotwords, and 90-minute multi-speaker TTS with up to 4 speakers and multilingual support. The project ships Colab demos, Hugging Face releases, finetuning code, and a real-time streaming variant.
published APR 28, 2026★★★★★
Read the sourcegithub.com/microsoft/VibeVoice
[*] Opens in a new tab · no tracking on Lantern's side
- Source
- GitHub Trending — Python
- Ingested
- APR 28, 2026 · 08:40
- Editorial score
- 4.0 / 5