FeedCette semaineArticle
articleHuggingFace Blog

Finetuning olmOCR to be a faithful OCR-Engine

Researchers fine-tuned olmOCR-7B-0225-preview to preserve header and footer information, making it a more faithful OCR engine for business documents. They created a dataset of 8,000 documents with Qwen2.5-VL-72B-Instruct, trained with 4 gradient accumulation steps on 8xH100 for 2.5 epochs, and evaluated on header/footer-inclusive data using document anchoring. The result is a practical improvement for invoices and other layout-rich texts.

publié 22 AVR. 2025★★★★
Lire la sourcehuggingface.co/blog/tngtech/finetuning-olmocr-to-be-a-faithful-ocr-engine
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
Source
HuggingFace Blog
Ingéré
22 AVR. 2025 · 19:10
Score édito
4.0 / 5