Nanonets-OCR2-3B – OCR model that transforms documents into structured markdown

12 points 4 comments 19 hours ago
igorpcosta

Wow, OCR is now basically a general domain. I remember when I spent like a year trying to create one for receipts. Took me 6 months of data curation to prepare.

Nice job, the scores are superb.

prats226

Yes, and its not just OCR (Optical Character Recognition), it understands layouts, captures signatures, charts, watermarks etc so way beyond just characters

PixelPanda

Excited to share Nanonets-OCR2, a state-of-the-art suite of models designed for advanced image-to-markdown conversion and Visual Question Answering (VQA).

Live Demo -> https://docstrange.nanonets.com/

Blog -> https://nanonets.com/research/nanonets-ocr-2/

antant13

wow. so many use cases. nice job.

Made by @calebRussel