Finetuning olmOCR to be a faithful OCR-Engine
Researchers fine-tuned olmOCR-7B-0225-preview to preserve header and footer information, making it a more faithful OCR engine for business documents. They created a dataset of 8,000 documents with Qwen2.5-VL-72B-Instruct, trained with 4 gradient accumulation steps on 8xH100 for 2.5 epochs, and evaluated on header/footer-inclusive data using document anchoring. The result is a practical improvement for invoices and other layout-rich texts.