FeedCette semaineArticle
articleSimon Willison

microsoft/VibeVoice

Microsoft's VibeVoice is a Whisper-style ASR model with speaker diarization and an MIT license. The article demos a Mac-based workflow (mlx-audio + 4-bit MLX) to transcribe a podcast, reports timing and memory usage, and notes that audio must be split for runs longer than an hour.

publié 27 AVR. 2026★★★★
Lire la sourcesimonwillison.net/2026/Apr/27/vibevoice/#atom-everything
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
Source
Simon Willison
Ingéré
27 AVR. 2026 · 08:40
Score édito
4.0 / 5