FeedCette semaineArticle
articleHuggingFace Blog

Open R1: Update #3

Open R1: Update #3 reports progress on reproducing the code-reasoning aspects of DeepSeek-R1, including a new CodeForces-CoTs dataset with ~100k samples in C++ and Python. It introduces the IOI benchmark and OlympicCoder models (7B and 32B), showing that training on CodeForces-CoTs yields top-tier performance, with OlympicCoder-32B outperforming several open-weight models. The post also covers dataset construction, benchmarks, and practical tips for scaling and prompting, such as prefilling with

publié 11 MARS 2025★★★★
Lire la sourcehuggingface.co/blog/open-r1/update-3
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
Source
HuggingFace Blog
Ingéré
11 MARS 2025 · 19:10
Score édito
4.0 / 5