articleHuggingFace Blog
Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H
Surfer-H is a web-native agent powered by the Holo1 family of Action Vision Language Models (VLMs), enabling GUI automation and precise UI localization. Holo1-3B and Holo1-7B achieve up to 76.2% UI localization accuracy on benchmarks, with open-source releases on Hugging Face and a WebClick benchmark of 1,639 tasks. Surfer-H operates entirely in-browser with a three-component architecture (Policy, Localizer, Validator) and claims cost-efficient performance.
publié 03 JUIN 2025★★★★★
Lire la sourcehuggingface.co/blog/Hcompany/holo1
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
- Source
- HuggingFace Blog
- Ingéré
- 03 JUIN 2025 · 19:10
- Score édito
- 4.0 / 5