olmOCR
Overview
olmOCR is AllenAI’s open-source document recognition pipeline and model family that converts PDFs and images into clean text, preserving reading order, tables, equations, and handwriting.
About Ai2
Ai2 is a 501(c)(3) non-profit AI research institute founded in 2014 by the late Paul Allen (Microsoft co-founder), dedicated to conducting high-impact, open AI research and engineering for the common good, including open language models (OLMo), scientific AI tools (Semantic Scholar, Asta), environmental AI platforms, and embodied robotics research.
