articleHuggingFace Blog
Open R1: Update #3
Open R1: Update #3 reports progress on reproducing the code-reasoning aspects of DeepSeek-R1, including a new CodeForces-CoTs dataset with ~100k samples in C++ and Python. It introduces the IOI benchmark and OlympicCoder models (7B and 32B), showing that training on CodeForces-CoTs yields top-tier performance, with OlympicCoder-32B outperforming several open-weight models. The post also covers dataset construction, benchmarks, and practical tips for scaling and prompting, such as prefilling with
publié 11 MARS 2025★★★★★
Lire la sourcehuggingface.co/blog/open-r1/update-3
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
- Source
- HuggingFace Blog
- Ingéré
- 11 MARS 2025 · 19:10
- Score édito
- 4.0 / 5