├── .github └── workflows │ └── python-publish.yml ├── .gitignore ├── Dockerfile ├── LICENSE ├── README.md ├── XPDF.jpg ├── app.py ├── azure-pipelines.yml ├── data_extraction.postman_collection.json ├── docker-compose.yml ├── extraction.py ├── global_common.py ├── requirements.txt ├── splitting.py └── tests └── tests /.github/workflows/python-publish.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/.github/workflows/python-publish.yml -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- 1 | __pycache__ -------------------------------------------------------------------------------- /Dockerfile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/Dockerfile -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/README.md -------------------------------------------------------------------------------- /XPDF.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/XPDF.jpg -------------------------------------------------------------------------------- /app.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/app.py -------------------------------------------------------------------------------- /azure-pipelines.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/azure-pipelines.yml -------------------------------------------------------------------------------- /data_extraction.postman_collection.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/data_extraction.postman_collection.json -------------------------------------------------------------------------------- /docker-compose.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/docker-compose.yml -------------------------------------------------------------------------------- /extraction.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/extraction.py -------------------------------------------------------------------------------- /global_common.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/global_common.py -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/requirements.txt -------------------------------------------------------------------------------- /splitting.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/splitting.py -------------------------------------------------------------------------------- /tests/tests: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ahmedkhemiri95/PDFs-TextExtract/HEAD/tests/tests --------------------------------------------------------------------------------