Joined December 2016
313 Photos and videos
Pinned Tweet
1 Aug 2025
Your LLMs are hungry for data, but documents are messy 😩. DocStrange is the answer! Our open-source solution turns any document into clean, LLM-ready data with one command. Give your models what they need. šŸ”— github.com/NanoNets/docstran… šŸ”— pypi.org/project/docstrange/
4
10
29
2,470
Nanonets OCR-3 is live. This is the most accurate OCR model in the world currently. 87.4 on OLM-OCR (Global #1) 85.9 on IDP Leaderboard (Global #1) 90.5 on OmniDocBench OCR-3 also ships with two critical features that foundational models and VLMs miss today - confidence scores and bounding boxes.
8
20
36
608
Nanonets OCR-3 is the only OCR model you'll need in your agentic stack. The model API exposes five endpoints - /parse - structured markdown /extract - structured outputs in your schema /split - classify or split outputs based on content /chunk - context-aware chunks optimized for RAG /vqa - grounded answers with bboxes over sources We've specifically fine-tuned the model on edge cases where OCR repeatedly fails - complex tables, forms, non-trivial layouts.
5
136
With bounding boxes, you get exact coordinates for every extracted element. Use them for - 1. RAG citations 2. Feeding specific document regions to agents 3. Agent observability With confidence scores, you can measure reliability of every extraction. Pass high-confidence outputs directly, route low-confidence outputs to human review or a larger model. Use them to push your net accuracy to near 100%.
1
6
190
Nanonets retweeted

1
1
3
137
Nanonets retweeted
Introducing Nanonets-OCR2: a lightweight 3B VLM that transforms documents into clean, structured Markdown and is capable of VQA. We have trained the model on close to 3 million documents. It is multi-lingual and can handle handwritten documents.
9
7
26
1,354
Nanonets retweeted
Replying to @nanonets
@nanonets just shipped Nanonets-OCR2: new 3B VLM for OCR! LaTeX equations, tables, handwriting, charts, multilingual - it does it all! You can try it against your data with one command via @huggingface Jobs - no local GPU needed! The HF Jobs command/output from the model šŸ‘‡
6
14
72
24,421
1 Aug 2025
Your LLMs are hungry for data, but documents are messy 😩. DocStrange is the answer! Our open-source solution turns any document into clean, LLM-ready data with one command. Give your models what they need. šŸ”— github.com/NanoNets/docstran… šŸ”— pypi.org/project/docstrange/
4
10
29
2,470
1 Aug 2025
✨ Features: • One-line installation • Intelligent OCR • Custom field extraction • JSON schema support • Zero-config cloud processing Perfect for AI training data prep! pip install docstrange #OpenSource #LLMReadyData #DocumentProcessing #Python #AI
1
441
Nanonets retweeted
We're excited to shareĀ Nanonets-OCR-s, a powerful and lightweight (3B) VLM model that converts documents into clean, structuredĀ Markdown. This model is trained to understand document structure and content context, like tables, equations, images, plots, watermarks, checkboxes, etc
12
11
40
2,262
9 May 2025
Just launched: The smartest AI-powered Resume Builder — 100% free, no strings attached! Try it now: resume.nanonets.com #jobsearch #resume #AItools #resumetips #careerdevelopment #nanonets

1
411
19 Mar 2025
#changelog Flexible Model Training Options. Our ā€œImprove Modelā€ feature has evolved to provide greater flexibility and precision in selecting training files. šŸ‘‰ changelog.nanonets.com/flexi…
1
3
402
17 Feb 2025
#changelog Identify New Templates in Instant Learning Models. You can now identify and flag new templates in Instant Learning Models through an approval rule. . šŸ‘‰ changelog.nanonets.com/ident…
1
368
Nanonets retweeted
12 Mar 2024
AI-based workflow automation platform @nanonets has raised Series B funding of $29 Mn (INR 240 Cr) šŸ‘‡ The latest round takes Nanonets’ total funding to $40 Mn to date. The startup plans to deploy the fresh proceeds for research and development, improving algorithms for handling unstructured data and launching new products. Besides, it aims to fuel growth by scaling up marketing and sales efforts to capitalise on the increasing demand for AI-based solutions. Founded in 2017 by Sarthak Jain and Prathamesh Juvatkar, Nanonets helps businesses leverage AI to make workflow automation easier. To read more in detail, click here: 4-2.co/3wOTp4x #startup #fundraising #investment #artificialintelligence
2
11
2,573
Nanonets retweeted
Replying to @nanonets
@nanonets raises $29M led by @Accel for AI-driven workflow automation. Offering no-code solutions to extract insights from documents, integrating with ERP systems for streamlined processes. Funds to fuel R&D and scale sales efforts. #Nanonets #AIStartup #Accel
1
2
7
903