FeedCette semaineArticle
articleHuggingFace Blog

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

Surfer-H is a web-native agent powered by the Holo1 family of Action Vision Language Models (VLMs), enabling GUI automation and precise UI localization. Holo1-3B and Holo1-7B achieve up to 76.2% UI localization accuracy on benchmarks, with open-source releases on Hugging Face and a WebClick benchmark of 1,639 tasks. Surfer-H operates entirely in-browser with a three-component architecture (Policy, Localizer, Validator) and claims cost-efficient performance.

publié 03 JUIN 2025★★★★
Lire la sourcehuggingface.co/blog/Hcompany/holo1
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
Source
HuggingFace Blog
Ingéré
03 JUIN 2025 · 19:10
Score édito
4.0 / 5