FeedThis weekArticle
articleHuggingFace Blog

Finetuning olmOCR to be a faithful OCR-Engine

Researchers fine-tuned olmOCR-7B-0225-preview to preserve header and footer information, making it a more faithful OCR engine for business documents. They created a dataset of 8,000 documents with Qwen2.5-VL-72B-Instruct, trained with 4 gradient accumulation steps on 8xH100 for 2.5 epochs, and evaluated on header/footer-inclusive data using document anchoring. The result is a practical improvement for invoices and other layout-rich texts.

published APR 22, 2025★★★★
Source
HuggingFace Blog
Ingested
APR 22, 2025 · 19:10
Editorial score
4.0 / 5