├── .gitignore ├── LICENSE ├── README.md ├── deleteYourPDF ├── __init__.py └── actions.py ├── delete_your_pdf.egg-info ├── PKG-INFO ├── SOURCES.txt ├── dependency_links.txt ├── requires.txt └── top_level.txt ├── dist ├── delete_your_pdf-1.0.2-py3-none-any.whl ├── delete_your_pdf-1.0.2.tar.gz ├── delete_your_pdf-1.0.3-py3-none-any.whl └── delete_your_pdf-1.0.3.tar.gz └── setup.py /.gitignore: -------------------------------------------------------------------------------- 1 | __pycache__/ -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- 1 | Apache License 2 | Version 2.0, January 2004 3 | http://www.apache.org/licenses/ 4 | 5 | TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION 6 | 7 | 1. Definitions. 8 | 9 | "License" shall mean the terms and conditions for use, reproduction, 10 | and distribution as defined by Sections 1 through 9 of this document. 11 | 12 | "Licensor" shall mean the copyright owner or entity authorized by 13 | the copyright owner that is granting the License. 14 | 15 | "Legal Entity" shall mean the union of the acting entity and all 16 | other entities that control, are controlled by, or are under common 17 | control with that entity. For the purposes of this definition, 18 | "control" means (i) the power, direct or indirect, to cause the 19 | direction or management of such entity, whether by contract or 20 | otherwise, or (ii) ownership of fifty percent (50%) or more of the 21 | outstanding shares, or (iii) beneficial ownership of such entity. 22 | 23 | "You" (or "Your") shall mean an individual or Legal Entity 24 | exercising permissions granted by this License. 25 | 26 | "Source" form shall mean the preferred form for making modifications, 27 | including but not limited to software source code, documentation 28 | source, and configuration files. 29 | 30 | "Object" form shall mean any form resulting from mechanical 31 | transformation or translation of a Source form, including but 32 | not limited to compiled object code, generated documentation, 33 | and conversions to other media types. 34 | 35 | "Work" shall mean the work of authorship, whether in Source or 36 | Object form, made available under the License, as indicated by a 37 | copyright notice that is included in or attached to the work 38 | (an example is provided in the Appendix below). 39 | 40 | "Derivative Works" shall mean any work, whether in Source or Object 41 | form, that is based on (or derived from) the Work and for which the 42 | editorial revisions, annotations, elaborations, or other modifications 43 | represent, as a whole, an original work of authorship. For the purposes 44 | of this License, Derivative Works shall not include works that remain 45 | separable from, or merely link (or bind by name) to the interfaces of, 46 | the Work and Derivative Works thereof. 47 | 48 | "Contribution" shall mean any work of authorship, including 49 | the original version of the Work and any modifications or additions 50 | to that Work or Derivative Works thereof, that is intentionally 51 | submitted to Licensor for inclusion in the Work by the copyright owner 52 | or by an individual or Legal Entity authorized to submit on behalf of 53 | the copyright owner. For the purposes of this definition, "submitted" 54 | means any form of electronic, verbal, or written communication sent 55 | to the Licensor or its representatives, including but not limited to 56 | communication on electronic mailing lists, source code control systems, 57 | and issue tracking systems that are managed by, or on behalf of, the 58 | Licensor for the purpose of discussing and improving the Work, but 59 | excluding communication that is conspicuously marked or otherwise 60 | designated in writing by the copyright owner as "Not a Contribution." 61 | 62 | "Contributor" shall mean Licensor and any individual or Legal Entity 63 | on behalf of whom a Contribution has been received by Licensor and 64 | subsequently incorporated within the Work. 65 | 66 | 2. Grant of Copyright License. Subject to the terms and conditions of 67 | this License, each Contributor hereby grants to You a perpetual, 68 | worldwide, non-exclusive, no-charge, royalty-free, irrevocable 69 | copyright license to reproduce, prepare Derivative Works of, 70 | publicly display, publicly perform, sublicense, and distribute the 71 | Work and such Derivative Works in Source or Object form. 72 | 73 | 3. Grant of Patent License. Subject to the terms and conditions of 74 | this License, each Contributor hereby grants to You a perpetual, 75 | worldwide, non-exclusive, no-charge, royalty-free, irrevocable 76 | (except as stated in this section) patent license to make, have made, 77 | use, offer to sell, sell, import, and otherwise transfer the Work, 78 | where such license applies only to those patent claims licensable 79 | by such Contributor that are necessarily infringed by their 80 | Contribution(s) alone or by combination of their Contribution(s) 81 | with the Work to which such Contribution(s) was submitted. If You 82 | institute patent litigation against any entity (including a 83 | cross-claim or counterclaim in a lawsuit) alleging that the Work 84 | or a Contribution incorporated within the Work constitutes direct 85 | or contributory patent infringement, then any patent licenses 86 | granted to You under this License for that Work shall terminate 87 | as of the date such litigation is filed. 88 | 89 | 4. Redistribution. You may reproduce and distribute copies of the 90 | Work or Derivative Works thereof in any medium, with or without 91 | modifications, and in Source or Object form, provided that You 92 | meet the following conditions: 93 | 94 | (a) You must give any other recipients of the Work or 95 | Derivative Works a copy of this License; and 96 | 97 | (b) You must cause any modified files to carry prominent notices 98 | stating that You changed the files; and 99 | 100 | (c) You must retain, in the Source form of any Derivative Works 101 | that You distribute, all copyright, patent, trademark, and 102 | attribution notices from the Source form of the Work, 103 | excluding those notices that do not pertain to any part of 104 | the Derivative Works; and 105 | 106 | (d) If the Work includes a "NOTICE" text file as part of its 107 | distribution, then any Derivative Works that You distribute must 108 | include a readable copy of the attribution notices contained 109 | within such NOTICE file, excluding those notices that do not 110 | pertain to any part of the Derivative Works, in at least one 111 | of the following places: within a NOTICE text file distributed 112 | as part of the Derivative Works; within the Source form or 113 | documentation, if provided along with the Derivative Works; or, 114 | within a display generated by the Derivative Works, if and 115 | wherever such third-party notices normally appear. The contents 116 | of the NOTICE file are for informational purposes only and 117 | do not modify the License. You may add Your own attribution 118 | notices within Derivative Works that You distribute, alongside 119 | or as an addendum to the NOTICE text from the Work, provided 120 | that such additional attribution notices cannot be construed 121 | as modifying the License. 122 | 123 | You may add Your own copyright statement to Your modifications and 124 | may provide additional or different license terms and conditions 125 | for use, reproduction, or distribution of Your modifications, or 126 | for any such Derivative Works as a whole, provided Your use, 127 | reproduction, and distribution of the Work otherwise complies with 128 | the conditions stated in this License. 129 | 130 | 5. Submission of Contributions. Unless You explicitly state otherwise, 131 | any Contribution intentionally submitted for inclusion in the Work 132 | by You to the Licensor shall be under the terms and conditions of 133 | this License, without any additional terms or conditions. 134 | Notwithstanding the above, nothing herein shall supersede or modify 135 | the terms of any separate license agreement you may have executed 136 | with Licensor regarding such Contributions. 137 | 138 | 6. Trademarks. This License does not grant permission to use the trade 139 | names, trademarks, service marks, or product names of the Licensor, 140 | except as required for reasonable and customary use in describing the 141 | origin of the Work and reproducing the content of the NOTICE file. 142 | 143 | 7. Disclaimer of Warranty. Unless required by applicable law or 144 | agreed to in writing, Licensor provides the Work (and each 145 | Contributor provides its Contributions) on an "AS IS" BASIS, 146 | WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or 147 | implied, including, without limitation, any warranties or conditions 148 | of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A 149 | PARTICULAR PURPOSE. You are solely responsible for determining the 150 | appropriateness of using or redistributing the Work and assume any 151 | risks associated with Your exercise of permissions under this License. 152 | 153 | 8. Limitation of Liability. In no event and under no legal theory, 154 | whether in tort (including negligence), contract, or otherwise, 155 | unless required by applicable law (such as deliberate and grossly 156 | negligent acts) or agreed to in writing, shall any Contributor be 157 | liable to You for damages, including any direct, indirect, special, 158 | incidental, or consequential damages of any character arising as a 159 | result of this License or out of the use or inability to use the 160 | Work (including but not limited to damages for loss of goodwill, 161 | work stoppage, computer failure or malfunction, or any and all 162 | other commercial damages or losses), even if such Contributor 163 | has been advised of the possibility of such damages. 164 | 165 | 9. Accepting Warranty or Additional Liability. While redistributing 166 | the Work or Derivative Works thereof, You may choose to offer, 167 | and charge a fee for, acceptance of support, warranty, indemnity, 168 | or other liability obligations and/or rights consistent with this 169 | License. However, in accepting such obligations, You may act only 170 | on Your own behalf and on Your sole responsibility, not on behalf 171 | of any other Contributor, and only if You agree to indemnify, 172 | defend, and hold each Contributor harmless for any liability 173 | incurred by, or claims asserted against, such Contributor by reason 174 | of your accepting any such warranty or additional liability. 175 | 176 | END OF TERMS AND CONDITIONS 177 | 178 | APPENDIX: How to apply the Apache License to your work. 179 | 180 | To apply the Apache License to your work, attach the following 181 | boilerplate notice, with the fields enclosed by brackets "[]" 182 | replaced with your own identifying information. (Don't include 183 | the brackets!) The text should be enclosed in the appropriate 184 | comment syntax for the file format. We also recommend that a 185 | file or class name and description of purpose be included on the 186 | same "printed page" as the copyright notice for easier 187 | identification within third-party archives. 188 | 189 | Copyright [yyyy] [name of copyright owner] 190 | 191 | Licensed under the Apache License, Version 2.0 (the "License"); 192 | you may not use this file except in compliance with the License. 193 | You may obtain a copy of the License at 194 | 195 | http://www.apache.org/licenses/LICENSE-2.0 196 | 197 | Unless required by applicable law or agreed to in writing, software 198 | distributed under the License is distributed on an "AS IS" BASIS, 199 | WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. 200 | See the License for the specific language governing permissions and 201 | limitations under the License. 202 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # Delete-Your-PDF 2 | 3 | Delete your PDF is a set of tools to export information from your PDFs so you can delete them. 4 | 5 | Image files can be taken in as both base64 strings or BytesIO objects 6 | 7 | Pip Library: [https://pypi.org/project/delete-your-pdf/](https://pypi.org/project/delete-your-pdf/) 8 | 9 | Pip Repo: [https://github.com/darefail/Delete-Your-PDF](https://github.com/darefail/Delete-Your-PDF) 10 | 11 | ## Live Demo 12 | 13 | **Live Demo:** [https://pdf.darefail.com](https://pdf.darefail.com) 14 | 15 | **Demo Opensource Repo:** [https://github.com/DareFail/AI-Video-Boilerplate-Pro](https://github.com/DareFail/AI-Video-Boilerplate-Pro) 16 | 17 | 18 | ### Installation 19 | 20 | ```sh 21 | pip install delete-your-pdf 22 | ``` 23 | 24 | ## How to use 25 | 26 | **countPdfPages**: Counts the number of pages in a PDF and returns an int 27 | ```sh 28 | from deleteYourPDF import countPdfPages 29 | 30 | numberOfPages = countPdfPages(file="PDF_FILE_HERE") 31 | ``` 32 | 33 | **pdfToImagePages**: Convert PDF to a list of pages that are PNG images as a base64 strings 34 | ```sh 35 | from deleteYourPDF import pdfToImagePages 36 | 37 | # Return a list containing all pages in order as images 38 | listOfImagePages = pdfToImagePages(file="PDF_FILE_HERE") 39 | 40 | # Return a list containing only an image of page 7 41 | listOfImagePages = pdfToImagePages(file="PDF_FILE_HERE", page_number=7) 42 | ``` 43 | 44 | **imageWidthHeight**: get the width and height of an image as a dictionary in pixels {width: 100, height: 100} 45 | ```sh 46 | from deleteYourPDF import imageWidthHeight 47 | 48 | image_dimensions = imageWidthHeight(file="IMAGE_FILE_HERE") 49 | 50 | width = image_dimensions["width"] 51 | height = image_dimensions["height"] 52 | ``` 53 | 54 | **cropRotateImage**: Crop and rotate an image and return a PNG image as a base64 string 55 | ```sh 56 | from deleteYourPDF import cropRotateImage 57 | 58 | # Returns an image of the top left 100x100 square from an image and rotates it 90 degrees to the right, the new image dimensions will match the rotation 59 | croppedAndRotatedImage = cropRotateImage(file="IMAGE_FILE_HERE", x=0, y=0, width=100, height=100, rotation_degrees=90) 60 | 61 | # Returns an image of the top left 100x100 square from an image and keep the original image dimensions 62 | croppedAndRotatedImage = cropRotateImage(file="IMAGE_FILE_HERE", x=0, y=0, width=100, height=100, rotation_degrees=30, expand_for_rotation=False) 63 | ``` 64 | 65 | **imageToText_Roboflow**: Convert image to text with Roboflow OCR and returns a string 66 | ```sh 67 | from deleteYourPDF import imageToText_Roboflow 68 | 69 | # Returns the text from a local image file 70 | text = imageToText_Roboflow(file="IMAGE_FILE_HERE", api_key="ROBOFLOW_API_KEY_HERE") 71 | ``` 72 | 73 | ## Example 1: Convert the top 100 pixels of all pages of a PDF to a list of text 74 | ```sh 75 | from deleteYourPDF import countPdfPages, pdfToImagePages, imageToText_Roboflow, cropRotateImage, imageWidthHeight 76 | 77 | listOfText = [] 78 | 79 | listOfImagePages = pdfToImagePages(file="PDF_FILE_HERE") 80 | 81 | for imagePage in listOfImagePages: 82 | image_dimensions = imageWidthHeight(file=imagePage) 83 | 84 | width = image_dimensions["width"] 85 | height = image_dimensions["height"] 86 | 87 | croppedAndRotatedImage = cropRotateImage(file=imagePage, x=0, y=0, width=width, height=100) 88 | listOfText.append(imageToText_Roboflow(file=croppedAndRotatedImage, api_key="ROBOFLOW_API_KEY_HERE")) 89 | 90 | return listOfText 91 | ``` 92 | 93 | ## Example 2: Rotate a 100x100 box in the center of page 7 90 degrees to the right on a PDF and print the text 94 | ```sh 95 | from deleteYourPDF import countPdfPages, pdfToImagePages, imageToText_Roboflow, cropRotateImage 96 | 97 | if countPdfPages(file="PDF_FILE_HERE") > 7: 98 | imagePage = pdfToImagePages(file="PDF_FILE_HERE", page_number=7) 99 | image_dimensions = imageWidthHeight(file=imagePage) 100 | 101 | width = image_dimensions["width"] 102 | height = image_dimensions["height"] 103 | x = (width - 100)/2 104 | y = (height - 100)/2 105 | 106 | croppedAndRotatedImage = cropRotateImage(file=imagePage, x=x, y=y, width=100, height=100, rotation_degrees=90) 107 | return imageToText_Roboflow(file=croppedAndRotatedImage, api_key="ROBOFLOW_API_KEY_HERE") 108 | ``` 109 | 110 | ## Acknowledgements 111 | 112 | Thanks to Roboflow for sponsoring this project. Get your free API key at: [Roboflow](https://roboflow.com/) 113 | 114 | ## License 115 | 116 | Distributed under the APACHE 2.0 License. See `LICENSE` for more information. 117 | 118 | ## Contact 119 | 120 | Twitter: [@darefailed](https://twitter.com/darefailed) 121 | 122 | Youtube: [How to Video coming soon](https://www.youtube.com/@darefail) 123 | 124 | Project Link: [https://github.com/darefail/Delete-Your-PDF](https://github.com/darefail/Delete-Your-PDF) 125 | 126 | 127 | ## Update Package 128 | ```sh 129 | python3 -m build 130 | python3 -m twine upload dist/* 131 | ``` 132 | -------------------------------------------------------------------------------- /deleteYourPDF/__init__.py: -------------------------------------------------------------------------------- 1 | from .actions import * -------------------------------------------------------------------------------- /deleteYourPDF/actions.py: -------------------------------------------------------------------------------- 1 | import pypdfium2 as pdfium 2 | import base64 3 | import requests 4 | from io import BytesIO 5 | from PIL import Image 6 | 7 | 8 | def countPdfPages(file: str) -> int: 9 | pdf = pdfium.PdfDocument(file) 10 | return len(pdf) 11 | 12 | 13 | def pdfToImagePages(file: str, page_number: int = None) -> list: 14 | imagePages = [] 15 | pdf = pdfium.PdfDocument(file) 16 | 17 | if page_number: 18 | page = pdf[page_number - 1] 19 | image = page.render(scale=4).to_pil() 20 | buffer = BytesIO() 21 | image.save(buffer, format="PNG") 22 | img_str = base64.b64encode(buffer.getvalue()).decode() 23 | img_str = f"data:image/png;base64,{img_str}" 24 | imagePages.append(img_str) 25 | else: 26 | for i in range(len(pdf)): 27 | page = pdf[i] 28 | image = page.render(scale=4).to_pil() 29 | buffer = BytesIO() 30 | image.save(buffer, format="PNG") 31 | img_str = base64.b64encode(buffer.getvalue()).decode() 32 | img_str = f"data:image/png;base64,{img_str}" 33 | imagePages.append(img_str) 34 | 35 | return imagePages 36 | 37 | 38 | def cropRotateImage(file, x: int, y: int, width: int, height: int, rotation_degrees: int = 0, expand_for_rotation: bool = True) -> str: 39 | 40 | if isinstance(file, str): 41 | base64_data = file.split(",")[1] 42 | image_data = base64.b64decode(base64_data) 43 | image = Image.open(BytesIO(image_data)) 44 | 45 | elif isinstance(file, BytesIO): 46 | image = Image.open(file) 47 | 48 | cropped_image = image.crop((x, y, x+width, y+height)) 49 | 50 | rotation_degrees = (360 - rotation_degrees) % 360 51 | 52 | rotated_image = cropped_image.rotate(rotation_degrees, expand=expand_for_rotation) 53 | 54 | buffered = BytesIO() 55 | rotated_image.save(buffered, format = "PNG") 56 | 57 | img_str = base64.b64encode(buffered.getvalue()).decode() 58 | 59 | img_str = f"data:image/png;base64,{img_str}" 60 | 61 | return img_str 62 | 63 | 64 | def imageWidthHeight(file) -> list: 65 | 66 | if isinstance(file, str): 67 | base64_data = file.split(",")[1] 68 | image_data = base64.b64decode(base64_data) 69 | image = Image.open(BytesIO(image_data)) 70 | 71 | elif isinstance(file, BytesIO): 72 | image = Image.open(file) 73 | 74 | widthHeight = image.size 75 | 76 | return { 77 | "width": widthHeight[0], 78 | "height": widthHeight[1], 79 | } 80 | 81 | 82 | 83 | def imageToText_Roboflow(file, api_key: str) -> str: 84 | 85 | if isinstance(file, str): 86 | base64_data = file.split(",")[1] 87 | image_data = base64.b64decode(base64_data) 88 | image = Image.open(BytesIO(image_data)) 89 | 90 | elif isinstance(file, BytesIO): 91 | image = Image.open(file) 92 | 93 | if image.mode == 'RGBA': 94 | rgb_image = image.convert('RGB') 95 | else: 96 | rgb_image = image 97 | 98 | byte_arr = BytesIO() 99 | rgb_image.save(byte_arr, format='PNG') 100 | encoded_image = base64.encodebytes(byte_arr.getvalue()).decode('ascii') 101 | 102 | data = { 103 | "image": { 104 | "type": "base64", 105 | "value": encoded_image 106 | } 107 | } 108 | 109 | ocr_results = requests.post("https://infer.roboflow.com/doctr/ocr?api_key=" + api_key, json=data).json() 110 | 111 | return ocr_results["result"] 112 | -------------------------------------------------------------------------------- /delete_your_pdf.egg-info/PKG-INFO: -------------------------------------------------------------------------------- 1 | Metadata-Version: 2.1 2 | Name: delete-your-pdf 3 | Version: 1.0.3 4 | Summary: Crop, Rotate, and extract text from your PDFs so you can delete them 5 | Home-page: https://github.com/DareFail/delete-your-pdf 6 | Author: James Steinberg 7 | Author-email: jamespsteinberg@gmail.com 8 | Description-Content-Type: text/markdown 9 | License-File: LICENSE 10 | Requires-Dist: pypdfium2 11 | Requires-Dist: pillow 12 | 13 | # Delete-Your-PDF 14 | 15 | Delete your PDF is a set of tools to export information from your PDFs so you can delete them. 16 | 17 | Image files can be taken in as both base64 strings or BytesIO objects 18 | 19 | Pip Library: [https://pypi.org/project/delete-your-pdf/](https://pypi.org/project/delete-your-pdf/) 20 | 21 | Pip Repo: [https://github.com/darefail/Delete-Your-PDF](https://github.com/darefail/Delete-Your-PDF) 22 | 23 | ## Live Demo 24 | 25 | **Live Demo:** [https://pdf.darefail.com](https://pdf.darefail.com) 26 | 27 | **Demo Opensource Repo:** [https://github.com/DareFail/AI-Video-Boilerplate-Pro](https://github.com/DareFail/AI-Video-Boilerplate-Pro) 28 | 29 | 30 | ### Installation 31 | 32 | ```sh 33 | pip install delete-your-pdf 34 | ``` 35 | 36 | ## How to use 37 | 38 | **countPdfPages**: Counts the number of pages in a PDF and returns an int 39 | ```sh 40 | from deleteYourPDF import countPdfPages 41 | 42 | numberOfPages = countPdfPages(file="PDF_FILE_HERE") 43 | ``` 44 | 45 | **pdfToImagePages**: Convert PDF to a list of pages that are PNG images as a base64 strings 46 | ```sh 47 | from deleteYourPDF import pdfToImagePages 48 | 49 | # Return a list containing all pages in order as images 50 | listOfImagePages = pdfToImagePages(file="PDF_FILE_HERE") 51 | 52 | # Return a list containing only an image of page 7 53 | listOfImagePages = pdfToImagePages(file="PDF_FILE_HERE", page_number=7) 54 | ``` 55 | 56 | **imageWidthHeight**: get the width and height of an image as a dictionary in pixels {width: 100, height: 100} 57 | ```sh 58 | from deleteYourPDF import imageWidthHeight 59 | 60 | image_dimensions = imageWidthHeight(file="IMAGE_FILE_HERE") 61 | 62 | width = image_dimensions["width"] 63 | height = image_dimensions["height"] 64 | ``` 65 | 66 | **cropRotateImage**: Crop and rotate an image and return a PNG image as a base64 string 67 | ```sh 68 | from deleteYourPDF import cropRotateImage 69 | 70 | # Returns an image of the top left 100x100 square from an image and rotates it 90 degrees to the right, the new image dimensions will match the rotation 71 | croppedAndRotatedImage = cropRotateImage(file="IMAGE_FILE_HERE", x=0, y=0, width=100, height=100, rotation_degrees=90) 72 | 73 | # Returns an image of the top left 100x100 square from an image and keep the original image dimensions 74 | croppedAndRotatedImage = cropRotateImage(file="IMAGE_FILE_HERE", x=0, y=0, width=100, height=100, rotation_degrees=30, expand_for_rotation=False) 75 | ``` 76 | 77 | **imageToText_Roboflow**: Convert image to text with Roboflow OCR and returns a string 78 | ```sh 79 | from deleteYourPDF import imageToText_Roboflow 80 | 81 | # Returns the text from a local image file 82 | text = imageToText_Roboflow(file="IMAGE_FILE_HERE", api_key="ROBOFLOW_API_KEY_HERE") 83 | ``` 84 | 85 | ## Example 1: Convert the top 100 pixels of all pages of a PDF to a list of text 86 | ```sh 87 | from deleteYourPDF import countPdfPages, pdfToImagePages, imageToText_Roboflow, cropRotateImage, imageWidthHeight 88 | 89 | listOfText = [] 90 | 91 | listOfImagePages = pdfToImagePages(file="PDF_FILE_HERE") 92 | 93 | for imagePage in listOfImagePages: 94 | image_dimensions = imageWidthHeight(file=imagePage) 95 | 96 | width = image_dimensions["width"] 97 | height = image_dimensions["height"] 98 | 99 | croppedAndRotatedImage = cropRotateImage(file=imagePage, x=0, y=0, width=width, height=100) 100 | listOfText.append(imageToText_Roboflow(file=croppedAndRotatedImage, api_key="ROBOFLOW_API_KEY_HERE")) 101 | 102 | return listOfText 103 | ``` 104 | 105 | ## Example 2: Rotate a 100x100 box in the center of page 7 90 degrees to the right on a PDF and print the text 106 | ```sh 107 | from deleteYourPDF import countPdfPages, pdfToImagePages, imageToText_Roboflow, cropRotateImage 108 | 109 | if countPdfPages(file="PDF_FILE_HERE") > 7: 110 | imagePage = pdfToImagePages(file="PDF_FILE_HERE", page_number=7) 111 | image_dimensions = imageWidthHeight(file=imagePage) 112 | 113 | width = image_dimensions["width"] 114 | height = image_dimensions["height"] 115 | x = (width - 100)/2 116 | y = (height - 100)/2 117 | 118 | croppedAndRotatedImage = cropRotateImage(file=imagePage, x=x, y=y, width=100, height=100, rotation_degrees=90) 119 | return imageToText_Roboflow(file=croppedAndRotatedImage, api_key="ROBOFLOW_API_KEY_HERE") 120 | ``` 121 | 122 | ## Acknowledgements 123 | 124 | Thanks to Roboflow for sponsoring this project. Get your free API key at: [Roboflow](https://roboflow.com/) 125 | 126 | ## License 127 | 128 | Distributed under the APACHE 2.0 License. See `LICENSE` for more information. 129 | 130 | ## Contact 131 | 132 | Twitter: [@darefailed](https://twitter.com/darefailed) 133 | 134 | Youtube: [How to Video coming soon](https://www.youtube.com/@darefail) 135 | 136 | Project Link: [https://github.com/darefail/Delete-Your-PDF](https://github.com/darefail/Delete-Your-PDF) 137 | 138 | 139 | ## Update Package 140 | ```sh 141 | python3 -m build 142 | python3 -m twine upload dist/* 143 | ``` 144 | -------------------------------------------------------------------------------- /delete_your_pdf.egg-info/SOURCES.txt: -------------------------------------------------------------------------------- 1 | LICENSE 2 | README.md 3 | setup.py 4 | deleteYourPDF/__init__.py 5 | deleteYourPDF/actions.py 6 | delete_your_pdf.egg-info/PKG-INFO 7 | delete_your_pdf.egg-info/SOURCES.txt 8 | delete_your_pdf.egg-info/dependency_links.txt 9 | delete_your_pdf.egg-info/requires.txt 10 | delete_your_pdf.egg-info/top_level.txt -------------------------------------------------------------------------------- /delete_your_pdf.egg-info/dependency_links.txt: -------------------------------------------------------------------------------- 1 | 2 | -------------------------------------------------------------------------------- /delete_your_pdf.egg-info/requires.txt: -------------------------------------------------------------------------------- 1 | pypdfium2 2 | pillow 3 | -------------------------------------------------------------------------------- /delete_your_pdf.egg-info/top_level.txt: -------------------------------------------------------------------------------- 1 | deleteYourPDF 2 | -------------------------------------------------------------------------------- /dist/delete_your_pdf-1.0.2-py3-none-any.whl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DareFail/Delete-Your-PDF/5544356dc2a217cda5e1efb9d54268609e6eddb9/dist/delete_your_pdf-1.0.2-py3-none-any.whl -------------------------------------------------------------------------------- /dist/delete_your_pdf-1.0.2.tar.gz: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DareFail/Delete-Your-PDF/5544356dc2a217cda5e1efb9d54268609e6eddb9/dist/delete_your_pdf-1.0.2.tar.gz -------------------------------------------------------------------------------- /dist/delete_your_pdf-1.0.3-py3-none-any.whl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DareFail/Delete-Your-PDF/5544356dc2a217cda5e1efb9d54268609e6eddb9/dist/delete_your_pdf-1.0.3-py3-none-any.whl -------------------------------------------------------------------------------- /dist/delete_your_pdf-1.0.3.tar.gz: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DareFail/Delete-Your-PDF/5544356dc2a217cda5e1efb9d54268609e6eddb9/dist/delete_your_pdf-1.0.3.tar.gz -------------------------------------------------------------------------------- /setup.py: -------------------------------------------------------------------------------- 1 | from setuptools import setup 2 | 3 | # read the contents of your README file 4 | from pathlib import Path 5 | this_directory = Path(__file__).parent 6 | long_description = (this_directory / "README.md").read_text() 7 | 8 | setup(name='delete-your-pdf', 9 | version='1.0.3', 10 | description='Crop, Rotate, and extract text from your PDFs so you can delete them', 11 | author='James Steinberg', 12 | author_email='jamespsteinberg@gmail.com', 13 | url='https://github.com/DareFail/delete-your-pdf', 14 | packages=['deleteYourPDF'], 15 | install_requires=['pypdfium2', 'pillow'], 16 | long_description=long_description, 17 | long_description_content_type='text/markdown', 18 | ) --------------------------------------------------------------------------------