Agentic OCR
Agentic OCR enables automatic editing of OCR results using vision language models, which can improve accuracy for complex tables (merged cells, nested headers, etc) and tricky text (handwriting, small symbols). To enable agentic OCR, use theenhance.agentic
parameter:
When to use agentic OCR?
Consider using agentic OCR when:- Accuracy is critical for your application, and you’re seeing small discrepancies in standard OCR.
- You’re willing to accept a small increase in processing time and cost (2x credits) for improved accuracy.
OCR system
For advanced users, thesettings.ocr_system
parameter allows you to specify which OCR system to use:
- standard (default): Our best multilingual OCR system that handles documents with languages of all kinds, including English, Spanish, Italian, Portuguese, French, German, and many others.
- legacy: Only supports Germanic languages (English, German, Dutch, etc.) and is available for backwards compatibility.