├── .github ├── ISSUE_TEMPLATE │ ├── bug.md │ └── config.yml └── workflows │ └── gen_whl_to_pypi.yml ├── .gitignore ├── .pre-commit-config.yaml ├── LICENSE ├── README.md ├── cliff.toml ├── demo.py ├── docs └── docs.md ├── rapidocr_pdf ├── __init__.py ├── main.py └── utils │ ├── __init__.py │ ├── logger.py │ └── utils.py ├── requirements.txt ├── setup.py └── tests ├── test_files ├── direct_and_image.pdf ├── direct_extract.pdf └── image.pdf └── test_main.py /.github/ISSUE_TEMPLATE/bug.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/.github/ISSUE_TEMPLATE/bug.md -------------------------------------------------------------------------------- /.github/ISSUE_TEMPLATE/config.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/.github/ISSUE_TEMPLATE/config.yml -------------------------------------------------------------------------------- /.github/workflows/gen_whl_to_pypi.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/.github/workflows/gen_whl_to_pypi.yml -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/.gitignore -------------------------------------------------------------------------------- /.pre-commit-config.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/.pre-commit-config.yaml -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/README.md -------------------------------------------------------------------------------- /cliff.toml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/cliff.toml -------------------------------------------------------------------------------- /demo.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/demo.py -------------------------------------------------------------------------------- /docs/docs.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/docs/docs.md -------------------------------------------------------------------------------- /rapidocr_pdf/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/rapidocr_pdf/__init__.py -------------------------------------------------------------------------------- /rapidocr_pdf/main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/rapidocr_pdf/main.py -------------------------------------------------------------------------------- /rapidocr_pdf/utils/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/rapidocr_pdf/utils/__init__.py -------------------------------------------------------------------------------- /rapidocr_pdf/utils/logger.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/rapidocr_pdf/utils/logger.py -------------------------------------------------------------------------------- /rapidocr_pdf/utils/utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/rapidocr_pdf/utils/utils.py -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- 1 | filetype>=1.2.0 2 | pymupdf 3 | rapidocr>=2.0.7 4 | colorlog 5 | onnxruntime -------------------------------------------------------------------------------- /setup.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/setup.py -------------------------------------------------------------------------------- /tests/test_files/direct_and_image.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/tests/test_files/direct_and_image.pdf -------------------------------------------------------------------------------- /tests/test_files/direct_extract.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/tests/test_files/direct_extract.pdf -------------------------------------------------------------------------------- /tests/test_files/image.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/tests/test_files/image.pdf -------------------------------------------------------------------------------- /tests/test_main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/RapidAI/RapidOCRPDF/HEAD/tests/test_main.py --------------------------------------------------------------------------------