articleSimon Willison
microsoft/VibeVoice
Microsoft's VibeVoice is a Whisper-style ASR model with speaker diarization and an MIT license. The article demos a Mac-based workflow (mlx-audio + 4-bit MLX) to transcribe a podcast, reports timing and memory usage, and notes that audio must be split for runs longer than an hour.
publié 27 AVR. 2026★★★★★
Lire la sourcesimonwillison.net/2026/Apr/27/vibevoice/#atom-everything
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
- Source
- Simon Willison
- Ingéré
- 27 AVR. 2026 · 08:40
- Score édito
- 4.0 / 5