A self-hosted PDF OCR API that converts scanned documents to markdown. Powered by PaddleOCR-VL, runs on GPU via Docker.
-
Updated
Apr 19, 2026 - Python
URL: http://github.com/topics/multilingual-ocr
.githubassets.com/assets/primer-a33d805aa3bce2cb.css" />A self-hosted PDF OCR API that converts scanned documents to markdown. Powered by PaddleOCR-VL, runs on GPU via Docker.
🔍 廖工AI设计实战出品 | LiaoGong-OCR — easyocr+tesseract双引擎OCR,15条预处理链,手机拍屏数字识别87%准确率 | Dual-engine OCR with 15 benchmarked preprocessing chains, 87% phone-photo digit accuracy
Multilingual structured OCR (11+ languages, CJK-tuned) — MCP server with verified per-character bboxes for AI agents
Add a description, image, and links to the multilingual-ocr topic page so that developers can more easily learn about it.
To associate your repository with the multilingual-ocr topic, visit your repo's landing page and select "manage topics."