padpdf-ocr

PaddleOCR for Chinese pdf

Rationale:

中文pdf的识别日常中要么用Adobe Acrobat, 精度尚可但是是付费软件，要么用tesseract但是只在极高清晰度时才有效果。结合pymupdf与paddleocr, 完成对中文扫描pdf的识别，并进行简单排版。

python pdf-ocr.py <pdf file path>

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
docs/examples		docs/examples
README.md		README.md
pdf-ocr.py		pdf-ocr.py
pdf-ocr_gui.py		pdf-ocr_gui.py
requirements.txt		requirements.txt