articleHuggingFace Blog
Finetuning olmOCR to be a faithful OCR-Engine
Researchers fine-tuned olmOCR-7B-0225-preview to preserve header and footer information, making it a more faithful OCR engine for business documents. They created a dataset of 8,000 documents with Qwen2.5-VL-72B-Instruct, trained with 4 gradient accumulation steps on 8xH100 for 2.5 epochs, and evaluated on header/footer-inclusive data using document anchoring. The result is a practical improvement for invoices and other layout-rich texts.
publié 22 AVR. 2025★★★★★
Lire la sourcehuggingface.co/blog/tngtech/finetuning-olmocr-to-be-a-faithful-ocr-engine
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
- Source
- HuggingFace Blog
- Ingéré
- 22 AVR. 2025 · 19:10
- Score édito
- 4.0 / 5