FeedThis weekArticle
articleSimon Willison

microsoft/VibeVoice

Microsoft's VibeVoice is a Whisper-style ASR model with speaker diarization and an MIT license. The article demos a Mac-based workflow (mlx-audio + 4-bit MLX) to transcribe a podcast, reports timing and memory usage, and notes that audio must be split for runs longer than an hour.

published APR 27, 2026★★★★
Read the sourcesimonwillison.net/2026/Apr/27/vibevoice/#atom-everything
[*] Opens in a new tab · no tracking on Lantern's side
Source
Simon Willison
Ingested
APR 27, 2026 · 08:40
Editorial score
4.0 / 5