FeedCette semaineArticle
articleHuggingFace Blog

TextQuests: How Good are LLMs at Text-Based Video Games?

TextQuests introduces a benchmark to evaluate LLM-based agents in text-based interactive games, emphasizing long-context reasoning and learning through exploration. It uses 25 classic Infocom games with two modes (With Clues / No Clues) and measures Game Progress and Harm over up to 500 steps, preserving full history for long-context evaluation.

publié 12 AOÛT 2025★★★★
Lire la sourcehuggingface.co/blog/textquests
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
Source
HuggingFace Blog
Ingéré
12 AOÛT 2025 · 19:10
Score édito
4.0 / 5