articleHuggingFace Blog

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

Kimina-Prover-72B uses a Test-Time Reinforcement Learning (TTRL) search to autonomously discover and reuse lemmas for long-horizon Lean proofs, plus an error-fixing module that interprets Lean messages for targeted corrections. It achieves a state-of-the-art miniF2F performance (92.2% pass rate) and shows pass@1/32/1024 of 63.9/84.0/87.7, with two distilled variants released (8B and 1.7B).

published JUL 10, 2025★★★★★

Read the sourcehuggingface.co/blog/AI-MO/kimina-prover

[*] Opens in a new tab · no tracking on Lantern's side

Source: HuggingFace Blog
Ingested: JUL 10, 2025 · 19:10
Editorial score: 4.0 / 5