articleHuggingFace Blog
TextQuests: How Good are LLMs at Text-Based Video Games?
TextQuests introduces a benchmark to evaluate LLM-based agents in text-based interactive games, emphasizing long-context reasoning and learning through exploration. It uses 25 classic Infocom games with two modes (With Clues / No Clues) and measures Game Progress and Harm over up to 500 steps, preserving full history for long-context evaluation.
publié 12 AOÛT 2025★★★★★
Lire la sourcehuggingface.co/blog/textquests
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
- Source
- HuggingFace Blog
- Ingéré
- 12 AOÛT 2025 · 19:10
- Score édito
- 4.0 / 5