articleHuggingFace Blog
TextQuests: How Good are LLMs at Text-Based Video Games?
TextQuests introduces a benchmark to evaluate LLM-based agents in text-based interactive games, emphasizing long-context reasoning and learning through exploration. It uses 25 classic Infocom games with two modes (With Clues / No Clues) and measures Game Progress and Harm over up to 500 steps, preserving full history for long-context evaluation.
published AUG 12, 2025★★★★★
Read the sourcehuggingface.co/blog/textquests
[*] Opens in a new tab · no tracking on Lantern's side
- Source
- HuggingFace Blog
- Ingested
- AUG 12, 2025 · 19:10
- Editorial score
- 4.0 / 5