PaddleOCR

Name: PaddleOCR
Author: PaddlePaddle

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit

by PaddlePaddle

Rating

0.0

Votes

score

Downloads

total

Price

Free

Access token required

Works With

Claude CodeCursorWindsurfVS CodeDeveloper tool

About

Global Leading OCR Toolkit & Document AI Engine

](https://pepy.tech/projects/paddleocr) [

](https://www.paddleocr.com) [ [](../LICENSE)

PaddleOCR converts PDF documents and images into structured, LLM-ready data (JSON/Markdown) with industry-leading accuracy. With 70k+ Stars and trusted by top-tier projects like Dify, RAGFlow, and Cherry Studio, PaddleOCR is the bedrock for building intelligent RAG and Agentic applications.

🚀 Key Features

📄 Intelligent Document Parsing (LLM-Ready)

Transforming messy visuals into structured data for the LLM era.

SOTA Document VLM: Featuring PaddleOCR-VL-1.5 (0.9B), the industry's leading lightweight vision-language model for document parsing. It excels in parsing complex documents across 5 major "Real-World" challenges: Warping, Scanning, Screen Photography, Illumination, and Skewed documents, with structured outputs in Markdown and JSON formats.
Structure-Aware Conversion: Powered by PP-StructureV3, seamlessly convert complex PDFs and images into Markdown or JSON. Unlike the PaddleOCR-VL series models, it provides more fine-grained coordinate information, including table cell coordinates, text coordinates, and more.
Production-Ready Efficiency: Achieve commercial-grade accuracy with an ultra-small footprint. Outperforms numerous closed-source solutions in public benchmarks while remaining resource-efficient for edge/cloud deployment.

🔍 Universal Text Recognition (Scene OCR)

The global gold standard for high-speed, multilingual text spotting.

100+ Languages Supported: Native recognition for a vast global library. Our PP-OCRv5 single-model solution elegantly handles multilingual mixed documents (Chinese, English, Japanese, Pinyin, etc.).
Complex Element Mastery: Beyond standard text recognition, we support natural scene text spotting across a wide range of environments, including IDs, street views, books, and industrial components
Performance Leap: PP-OCRv5 delivers a 13% accuracy boost over previous versions, maintaining the "Extreme Efficiency" that PaddleOCR is famous for.

Don't lose this

Three weeks from now, you'll want PaddleOCR again. Will you remember where to find it?

Save it to your library and the next time you need PaddleOCR, it’s one tap away — from any AI app you use. Group it into a bench with the rest of the team for that kind of task and you can pull the whole stack at once.

⚡ Pro tip for geeks: add a-gnt 🤵🏻‍♂️ as a custom connector in Claude or a custom GPT in ChatGPT — one click and your library is right there in the chat. Or, if you’re in an editor, install the a-gnt MCP server and say “use my [bench name]” in Claude Code, Cursor, VS Code, or Windsurf.

🤵🏻‍♂️

a-gnt's Take

Our honest review

This plugs directly into your AI and gives it new abilities it didn't have before. Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit. Once connected, just ask your AI to use it. It's completely free and works across most major AI apps.

Tips for getting started

Tap "Get" above, pick your AI app, and follow the steps. Most installs take under 30 seconds.

Design & Media

What's New

Version 1.0.03 months ago

Imported from GitHub

Ratings & Reviews

0.0

out of 5

0 ratings

No reviews yet. Be the first to share your experience.

View Source Code

Featured in Benches

MCP Server Starter Pack

8 tools · by joey-io

Context7

DEV

Up-to-date docs for any library, instantly

by Upstash

Ref Tools MCP

DEV

Up-to-date documentation for thousands of public repos

by ref-tools

Puppeteer

DEV

Control a web browser — navigate, screenshot, and interact with pages

by Anthropic

Supabase MCP

DEV

Connect AI agents to Supabase database, auth, and edge functions

by supabase-community

BrowserStack MCP

DEV

Cross-browser testing platform for AI agents

by browserstack

FHIR Healthcare MCP

DEV

Healthcare data via FHIR API integration

by wso2