├── .github ├── dependabot.yml └── workflows │ ├── ci.yml │ └── codeql-analysis.yml ├── .gitignore ├── CHANGELOG.md ├── Dockerfile ├── LICENSE ├── Makefile ├── README.md ├── exploration-notebooks └── .gitkeep ├── img ├── 0.png └── unstructured_logo.png ├── lib └── libstdc++.so.6 ├── logger_config.yaml ├── pipeline-notebooks ├── .gitkeep └── pipeline-paddleocr.ipynb ├── prepline_paddleocr ├── __init__.py └── api │ ├── __init__.py │ ├── app.py │ └── paddleocr.py ├── preprocessing-pipeline-family.yaml ├── requirements ├── base.in ├── base.txt ├── dev.in ├── dev.txt ├── test.in └── test.txt ├── sample-docs ├── .gitkeep └── sample-receipt.jpg ├── scripts ├── check-and-format-notebooks.py ├── docker-build.sh ├── shellcheck.sh ├── test-doc-pipeline-apis-consistent.sh └── version-sync.sh ├── setup.cfg └── test_paddleocr └── api ├── .gitkeep └── test_paddleocr.py /.github/dependabot.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/.github/dependabot.yml -------------------------------------------------------------------------------- /.github/workflows/ci.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/.github/workflows/ci.yml -------------------------------------------------------------------------------- /.github/workflows/codeql-analysis.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/.github/workflows/codeql-analysis.yml -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/.gitignore -------------------------------------------------------------------------------- /CHANGELOG.md: -------------------------------------------------------------------------------- 1 | ## 0.0.1 2 | -------------------------------------------------------------------------------- /Dockerfile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/Dockerfile -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/LICENSE -------------------------------------------------------------------------------- /Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/Makefile -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/README.md -------------------------------------------------------------------------------- /exploration-notebooks/.gitkeep: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /img/0.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/img/0.png -------------------------------------------------------------------------------- /img/unstructured_logo.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/img/unstructured_logo.png -------------------------------------------------------------------------------- /lib/libstdc++.so.6: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/lib/libstdc++.so.6 -------------------------------------------------------------------------------- /logger_config.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/logger_config.yaml -------------------------------------------------------------------------------- /pipeline-notebooks/.gitkeep: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /pipeline-notebooks/pipeline-paddleocr.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/pipeline-notebooks/pipeline-paddleocr.ipynb -------------------------------------------------------------------------------- /prepline_paddleocr/__init__.py: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /prepline_paddleocr/api/__init__.py: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /prepline_paddleocr/api/app.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/prepline_paddleocr/api/app.py -------------------------------------------------------------------------------- /prepline_paddleocr/api/paddleocr.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/prepline_paddleocr/api/paddleocr.py -------------------------------------------------------------------------------- /preprocessing-pipeline-family.yaml: -------------------------------------------------------------------------------- 1 | name: paddleocr 2 | version: 0.0.1 3 | -------------------------------------------------------------------------------- /requirements/base.in: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/requirements/base.in -------------------------------------------------------------------------------- /requirements/base.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/requirements/base.txt -------------------------------------------------------------------------------- /requirements/dev.in: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/requirements/dev.in -------------------------------------------------------------------------------- /requirements/dev.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/requirements/dev.txt -------------------------------------------------------------------------------- /requirements/test.in: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/requirements/test.in -------------------------------------------------------------------------------- /requirements/test.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/requirements/test.txt -------------------------------------------------------------------------------- /sample-docs/.gitkeep: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /sample-docs/sample-receipt.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/sample-docs/sample-receipt.jpg -------------------------------------------------------------------------------- /scripts/check-and-format-notebooks.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/scripts/check-and-format-notebooks.py -------------------------------------------------------------------------------- /scripts/docker-build.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/scripts/docker-build.sh -------------------------------------------------------------------------------- /scripts/shellcheck.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/scripts/shellcheck.sh -------------------------------------------------------------------------------- /scripts/test-doc-pipeline-apis-consistent.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/scripts/test-doc-pipeline-apis-consistent.sh -------------------------------------------------------------------------------- /scripts/version-sync.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/scripts/version-sync.sh -------------------------------------------------------------------------------- /setup.cfg: -------------------------------------------------------------------------------- 1 | [flake8] 2 | max-line-length = 100 3 | exclude = 4 | prepline_*/api 5 | -------------------------------------------------------------------------------- /test_paddleocr/api/.gitkeep: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /test_paddleocr/api/test_paddleocr.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Unstructured-IO/pipeline-paddleocr/HEAD/test_paddleocr/api/test_paddleocr.py --------------------------------------------------------------------------------