PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 80+ languages.

PaddleOCR
54.4k
Beginner Friendly
External Contribution
High
External Merge Rate
Medium
Avg. Time to Merge
Medium
Good First Issues
1 issue

Languages

Python
77.3%
C++
14.7%
Shell
5.2%
Java
1.2%
CMake
0.4%
Cuda
0.4%
C
0.2%
JavaScript
0.2%
Linker Script
0.2%
Makefile
0.2%

Good First Issues

Ideal for first-time contributors

README.md

Community