├── .gitignore ├── Makefile ├── README.md └── docs ├── processInbox.sh └── src └── .keep /.gitignore: -------------------------------------------------------------------------------- 1 | /docs/src 2 | /docs/dst 3 | .DS_Store -------------------------------------------------------------------------------- /Makefile: -------------------------------------------------------------------------------- 1 | 2 | run: 3 | @docker run -it -v $(shell pwd)/docs:/ocr --entrypoint "/bin/bash" jbarlow83/ocrmypdf -c 'cd /ocr && ./processInbox.sh' -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # easy-ocr 2 | 3 | 1. put scanned PDFs in `src` folder 4 | 2. do `make run` 5 | 3. grab OCR'd filed out of `dst` folder 6 | 7 | ## requirements 8 | 9 | * have docker running 10 | * have make installed 11 | -------------------------------------------------------------------------------- /docs/processInbox.sh: -------------------------------------------------------------------------------- 1 | #!/bin/bash 2 | mkdir dst || true 3 | cd src 4 | FILES=* 5 | for f in $FILES 6 | do 7 | echo "Processing '$f'" 8 | # take action on each file. $f store current file name 9 | /usr/local/bin/ocrmypdf -l eng+deu "$f" "../dst/$f" 10 | done 11 | -------------------------------------------------------------------------------- /docs/src/.keep: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/extrawurst/easy-ocr/e9dfb784c236b07b010d940c8f485002d8e584ff/docs/src/.keep --------------------------------------------------------------------------------