FeedCette semaineArticle
articleGitHub Trending — Python

microsoft/VibeVoice

VibeVoice is an open-source family of frontier voice AI models (ASR and TTS) using 7.5 Hz continuous speech tokenizers and a next-token diffusion framework guided by an LLM. It enables 60-minute long-form ASR with diarization and custom hotwords, and 90-minute multi-speaker TTS with up to 4 speakers and multilingual support. The project ships Colab demos, Hugging Face releases, finetuning code, and a real-time streaming variant.

publié 28 AVR. 2026★★★★
Lire la sourcegithub.com/microsoft/VibeVoice
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
Source
GitHub Trending — Python
Ingéré
28 AVR. 2026 · 08:40
Score édito
4.0 / 5