articleHuggingFace Blog

TextQuests: How Good are LLMs at Text-Based Video Games?

TextQuests introduces a benchmark to evaluate LLM-based agents in text-based interactive games, emphasizing long-context reasoning and learning through exploration. It uses 25 classic Infocom games with two modes (With Clues / No Clues) and measures Game Progress and Harm over up to 500 steps, preserving full history for long-context evaluation.

publié 12 AOÛT 2025★★★★★

Lire la sourcehuggingface.co/blog/textquests

[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern

Source: HuggingFace Blog
Ingéré: 12 AOÛT 2025 · 19:10
Score édito: 4.0 / 5