FeedThis weekArticle
articleHuggingFace Blog

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

Kimina-Prover-72B uses a Test-Time Reinforcement Learning (TTRL) search to autonomously discover and reuse lemmas for long-horizon Lean proofs, plus an error-fixing module that interprets Lean messages for targeted corrections. It achieves a state-of-the-art miniF2F performance (92.2% pass rate) and shows pass@1/32/1024 of 63.9/84.0/87.7, with two distilled variants released (8B and 1.7B).

published JUL 10, 2025★★★★
Read the sourcehuggingface.co/blog/AI-MO/kimina-prover
[*] Opens in a new tab · no tracking on Lantern's side
Source
HuggingFace Blog
Ingested
JUL 10, 2025 · 19:10
Editorial score
4.0 / 5