Nanonets-OCR2-3B – OCR model that transforms documents into structured markdown
12 points 4 comments 19 hours ago
PixelPanda
Excited to share Nanonets-OCR2, a state-of-the-art suite of models designed for advanced image-to-markdown conversion and Visual Question Answering (VQA).
Live Demo -> https://docstrange.nanonets.com/
antant13
wow. so many use cases. nice job.
Wow, OCR is now basically a general domain. I remember when I spent like a year trying to create one for receipts. Took me 6 months of data curation to prepare.
Nice job, the scores are superb.
Yes, and its not just OCR (Optical Character Recognition), it understands layouts, captures signatures, charts, watermarks etc so way beyond just characters