articleHuggingFace Blog

TextQuests: How Good are LLMs at Text-Based Video Games?

TextQuests introduces a benchmark to evaluate LLM-based agents in text-based interactive games, emphasizing long-context reasoning and learning through exploration. It uses 25 classic Infocom games with two modes (With Clues / No Clues) and measures Game Progress and Harm over up to 500 steps, preserving full history for long-context evaluation.

published AUG 12, 2025★★★★★

Read the sourcehuggingface.co/blog/textquests

[*] Opens in a new tab · no tracking on Lantern's side

Source: HuggingFace Blog
Ingested: AUG 12, 2025 · 19:10
Editorial score: 4.0 / 5