├── .gitignore ├── README.md ├── bin ├── download_hrc_pdfs.py ├── extract_json_field.py ├── extract_text_from_pdfs.sh ├── split_emails.py └── strip_statedept_headers.py └── data ├── .gitignore ├── hrc-data-model.png ├── neo4j_export.10k.csv └── neo4j_export.csv /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/agussman/hrc-email/HEAD/.gitignore -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/agussman/hrc-email/HEAD/README.md -------------------------------------------------------------------------------- /bin/download_hrc_pdfs.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/agussman/hrc-email/HEAD/bin/download_hrc_pdfs.py -------------------------------------------------------------------------------- /bin/extract_json_field.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/agussman/hrc-email/HEAD/bin/extract_json_field.py -------------------------------------------------------------------------------- /bin/extract_text_from_pdfs.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/agussman/hrc-email/HEAD/bin/extract_text_from_pdfs.sh -------------------------------------------------------------------------------- /bin/split_emails.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/agussman/hrc-email/HEAD/bin/split_emails.py -------------------------------------------------------------------------------- /bin/strip_statedept_headers.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/agussman/hrc-email/HEAD/bin/strip_statedept_headers.py -------------------------------------------------------------------------------- /data/.gitignore: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /data/hrc-data-model.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/agussman/hrc-email/HEAD/data/hrc-data-model.png -------------------------------------------------------------------------------- /data/neo4j_export.10k.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/agussman/hrc-email/HEAD/data/neo4j_export.10k.csv -------------------------------------------------------------------------------- /data/neo4j_export.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/agussman/hrc-email/HEAD/data/neo4j_export.csv --------------------------------------------------------------------------------