FeedThis weekArticle
articleHuggingFace Blog

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

Surfer-H is a web-native agent powered by the Holo1 family of Action Vision Language Models (VLMs), enabling GUI automation and precise UI localization. Holo1-3B and Holo1-7B achieve up to 76.2% UI localization accuracy on benchmarks, with open-source releases on Hugging Face and a WebClick benchmark of 1,639 tasks. Surfer-H operates entirely in-browser with a three-component architecture (Policy, Localizer, Validator) and claims cost-efficient performance.

published JUN 03, 2025★★★★
Read the sourcehuggingface.co/blog/Hcompany/holo1
[*] Opens in a new tab · no tracking on Lantern's side
Source
HuggingFace Blog
Ingested
JUN 03, 2025 · 19:10
Editorial score
4.0 / 5