PageIndex OCR is a long-context OCR model designed to preserve the global structure of documents. It recognizes true hierarchy and semantic relationships across document pages, aiming to address issues common in traditional OCR. In our internal benchmarks, it outperforms other solutions such as Mistral and Contextual AI.
- Blog: https://pageindex.ai/blog/ocr
- API: https://docs.pageindex.ai/quickstart
- Dashboard: https://dash.pageindex.ai
Feedback is welcome.