├── cookiecutter.json
├── hooks
    ├── post_gen_project.sh
    └── pre_gen_project.py
├── readme.md
└── {{cookiecutter.project_slug}}
    ├── .export_rmarkdown.R
    ├── .first_install.py
    ├── .gitignore
    ├── .nbconvert_templates
        └── ap_report
        │   ├── ap.svg
        │   ├── conf.json
        │   ├── index.html.j2
        │   └── static
        │       └── style.css
    ├── .set_kernel_path.sh
    ├── README.md
    ├── _quarto.yml
    ├── analysis
        ├── .gitkeep
        ├── archive
        │   └── .gitkeep
        └── notebook_templates
        │   └── ap_data_team
        │       ├── quarto.ipynb
        │       └── rmarkdown.ipynb
    ├── data
        ├── .gitignore
        ├── documentation
        │   └── .gitignore
        ├── handmade
        │   └── .gitignore
        ├── html_reports
        │   └── .gitignore
        ├── processed
        │   └── .gitignore
        ├── public
        │   └── .gitignore
        └── source
        │   └── .gitignore
    ├── etl
        └── .gitkeep
    ├── publish
        └── .gitkeep
    └── scratch
        └── .gitkeep


/cookiecutter.json:
--------------------------------------------------------------------------------
 1 | {
 2 |   "full_name": "Firstname Lastname",
 3 |   "email": "",
 4 |   "project_name": "New Project",
 5 |   "project_slug": "{{ cookiecutter.project_name.lower().replace(' ', '-') }}",
 6 |   "project_short_description": "TK: short project description",
 7 |   "_copy_without_render": [
 8 |     "analysis/*",
 9 |     ".nbconvert_templates/*"
10 |   ]
11 | }
12 | 


--------------------------------------------------------------------------------
/hooks/post_gen_project.sh:
--------------------------------------------------------------------------------
 1 | #!/bin/bash
 2 | ## This post project generation script only runs if pipenv is on the machine
 3 | command -v pipenv >/dev/null 2>&1 || { echo >&2 "pipenv not found.  Aborting startup script."; exit 1; }
 4 | 
 5 | ## Run first_install script
 6 | #### This is meant to be run when people first clone the project.
 7 | #### Running it here to add jupyter data directory env variable, to set the RETICULATE_PYTHON r env
 8 | ###### variable, to set up the jupyter lab template directory/enable its server,
 9 | ###### and to set up the git solution for changing cwd in an analysis file.
10 | python ./.first_install.py
11 | 


--------------------------------------------------------------------------------
/hooks/pre_gen_project.py:
--------------------------------------------------------------------------------
 1 | import re
 2 | import sys
 3 | 
 4 | SLUG_REGEX = r'^[a-zA-Z0-9][-_a-zA-Z0-9]+$'
 5 | slug = '{{ cookiecutter.project_slug }}'
 6 | 
 7 | if not re.match(SLUG_REGEX, slug):
 8 |     print(f'ERROR: {slug} is not a valid project slug!')
 9 |     sys.exit(1)
10 | 


--------------------------------------------------------------------------------
/readme.md:
--------------------------------------------------------------------------------
  1 | # AP Python Cookiecutter
  2 | 
  3 | This is a project template powered by [Cookiecutter](https://github.com/cookiecutter/cookiecutter) for use with [datakit-project](https://github.com/associatedpress/datakit-project/).
  4 | 
  5 | **Structure**
  6 | 
  7 | ```
  8 | .
  9 | ├── README.md
 10 | ├── analysis
 11 | │   └── archive
 12 | ├── data
 13 | │   ├── documentation
 14 | │   ├── html_reports
 15 | │   ├── manual
 16 | │   ├── processed
 17 | │   ├── public
 18 | │   └── source
 19 | ├── etl
 20 | ├── publish
 21 | └── scratch
 22 | ```
 23 | 
 24 | - `README.md`
 25 |   - Project-specific readme with boilerplate for data projects.
 26 | - `analysis`
 27 |   - This is where we keep all of our jupyter ipython notebooks that contain analysis for the project.
 28 |     - Notebooks in this folder can ingest data from either `data/source` (if that data comes from the source in a workable format) or `data/processed` (if the data required some prep).
 29 |     - Dataframes from analysis notebooks should be written out to `data/processed`
 30 |   - `analysis/archive`: Notebooks that leave the scope of the project but should also remain in the project history will be placed here.
 31 |   - Note that only `.Rmd` linked to `.ipynb` via `Jupytext` are commited, `.ipynb` are in the `.gitignore` because `.ipynb` metadata frequently disrupts version control whenever a notebook is opened or interacted with, while `.Rmd` files only keep track of code.
 32 | - `data`
 33 |   - This is the directory used with our `datakit-data` plugin.
 34 |   - `data/documentation`
 35 |     - Documentation on data files should go here - data dictionaries, manuals, interview notes.
 36 |   - `data/html_reports`
 37 |     - Contains rendered html of our analysis notebooks, the results of calling `pipenv run export_rmarkdown` on a notebook.
 38 |   - `data/manual`
 39 |     - Contains data that has been manually altered (e.g. excel workbooks with inconsistent string errors requiring eyes on every row).
 40 |   - `data/processed`
 41 |     - Contains data that has either been transformed from an `etl` script or output from an `analysis` jupyter notebook.
 42 |     - Data that has been transformed from an `etl` script will follow a naming convention: `etl_{file_name}.[csv,json...]`
 43 |   - `data/public`
 44 |     - Public-facing data files go here - data files which are 'live'.
 45 |   - `data/source`: contains raw, untouched data.
 46 | - `etl`
 47 |   - This is where we keep python scripts involved with collecting data and prepping it for analysis.
 48 |   - These files should be scripts, they should not be jupyter notebooks.
 49 | - `publish`
 50 |   - This directory holds all the documents in the project that will be public facing (e.g. data.world documents).
 51 | - `scratch`
 52 |   - This directory contains output that will not be used in the project in its final form.
 53 |   - Common cases are filtered tables or quick visualizations for reporters
 54 |   - This directory is not git tracked.
 55 | 
 56 | **Our `.gitignore`**
 57 | 
 58 | ```
 59 | *.vim
 60 | .env
 61 | .Renviron
 62 | .venv
 63 | .quarto
 64 | .DS_Store
 65 | .ipynb_checkpoints
 66 | 
 67 | analysis/*.ipynb
 68 | analysis/archive/*.ipynb
 69 | !analysis/notebook_templates/*.ipynb
 70 | 
 71 | data/
 72 | !data/source/.gitkeep
 73 | !data/manual/.gitkeep
 74 | !data/processed/.gitkeep
 75 | !data/html_reports/.gitkeep
 76 | !data/public/.gitkeep
 77 | !data/documentation/.gitkeep
 78 | 
 79 | scratch/
 80 | !scratch/.gitkeep
 81 | ```
 82 | 
 83 | ## Usage
 84 | 
 85 | These steps assume configuration for [datakit-project](https://github.com/associatedpress/datakit-project) are complete.
 86 | 
 87 | - If you'd like to keep a local version of this template on your computer, git clone this repository to where your cookiecutters live:
 88 | 
 89 | ```
 90 | cd path/to/.cookiecutters
 91 | git clone git@github.com/associatedpress/cookiecutter-python-project.git
 92 | ```
 93 | 
 94 | - Now, when starting a new project with `datakit-project`, reference the cookiecutter in your filesystem. This creates a `pipenv` virtual environment and a ipython kernel for jupyter notebooks that will have the name of the `project_slug`.
 95 | 
 96 | ```
 97 | datakit project create --template path/to/.cookiecutters/cookiecutter-python-project`
 98 | ```
 99 | 
100 | If you'd like to avoid specifying the template each time, you can edit `~/.datakit/plugins/datakit-project/config.json` to use this template by default:
101 | 
102 | ```
103 |  {"default_template": "/path/to/.cookiecutters/cookiecutter-python-project"}
104 | ```
105 | 
106 | ### Full virtual environment setup. From package management to rendering analyses.
107 | 
108 | This python template should get AP data journalists set up quickly with a virtual environment, allowing them to clone a project and quickly install all the packages required to run ETL and analysis files. 
109 | 
110 | **Setup**
111 | 
112 | *This is the required setup to get the full python package management functionality provided by this template:*
113 | 
114 | - [Pyenv](https://github.com/pyenv/pyenv) to manage our python installations. `brew install pyenv`
115 | 
116 |   - We need to install a python with shared libraries via `pyenv` using the option `--enable-shared`. This gives us the ability to interact with our R install, should we ever wish to write R code in an R cell in Jupyter, or use R from a python instance using the python library `rpy2`. If we were to install version 3.9.13, for example: `env PYTHON_CONFIGURE_OPTS="--enable-shared" pyenv install 3.9.13`.
117 | 
118 | - [Pipenv](https://pipenv.pypa.io/en/latest/) to manage the python packages necessary for our project. We switch to our python with shared libraries we installed earlier (in this case version 3.9.13) with `pyenv global 3.9.13` and then pip install pipenv `python -m pip install pipenv`. It is possible to brew install pipenv, but those who maintain pipenv do not maintain that brew install of the software. They suggest pip installing.
119 | 
120 | - [Quarto](https://quarto.org/) to render our analysis notebooks. To install, we use the [CLI installer](https://quarto.org/docs/get-started/) available on their site.
121 | 
122 | - Finally, we install `datakit` on the pyenv python with shared libraries. The config files we set up when we first installed datakit will work with datakit installs across different versions of python. `python -m pip install datakit-gitlab datakit-project datakit-data`.
123 | 
124 | **Workflow**
125 | 
126 | *Starting a new project*
127 | - `datakit project create` will kick off the typical datakit cookiecutter project creation, but this template runs an additional script after constructing the AP analysis folder tree. Briefly, this script sets up the project for pipenv and installs our typical analysis packages. You can find this script in your project: `.first_install.py`. A more detailed description for this script will come with an update to the README.
128 | 
129 | - Once the project is created we `cd` into it and run `pipenv shell` Before running `jupyter lab`. Or, we run `pipenv run jupyter lab`. It's up to you which commands to use here. Some people like to have a subshell running via `pipenv shell`, knowing that any command they run in that open subshell will make use of the pipenv environment. Other people like to type in the command every time they want to use the virtual environment with `pipenv run [terminal command]`.
130 | 
131 | - Whenever we need to install a package, we use `pipenv install [some_package]`.
132 | 
133 | - We don't git track `.ipynb` notebooks. Instead, we use [Jupytext](https://jupytext.readthedocs.io/en/latest/) to link our `.ipynb` files to git-tracked `.Rmd` files. This makes `git diff`s much more useful. `git status` shouldn't say our analysis changed because we ran a cell again. This makes sure it doesn't.
134 | 
135 | - When we start an analysis notebook, we use the folder tree in Jupyter Lab to get to our analysis folder and open a new Launcher Window (`shift + command + L`). Under the "Notebook" section, we select the option called "Template". This brings up a dropdown selection menu. Select `ap_data_team` on the top dropdown, and `quarto.ipynb` on the bottom. This should bring up another option to select your ipython kernel. Select the kernel named after your project.
136 |   - At this point, you have an analysis notebook file that is linked to an `.Rmd` with the same name. The first time you save your `.ipynb` file, you'll see that `.Rmd` appear alongside your `.ipynb` file. If you ever rename the `.ipynb`, the name of the `.Rmd` will change to match it.
137 |   - You can still create a typical `.ipynb` analysis without the template (and without the paired `.Rmd`). Just keep in mind that without a paired `.Rmd` the analysis will not be git-tracked, unless you add an exception for the `.ipynb` file in the `.gitignore`.
138 | 
139 | - While we are coding our analysis, we have the ability through Quarto to preview the rendered html file. Run `quarto preview path/to/analysis.ipynb`.
140 | 
141 | - When we're ready to render and share our analysis, we make sure Quarto executes the cells in the notebook to render fresh output. Run `quarto render path/to/analysis.ipynb --to html --execute`.
142 | 
143 | *Cloning a project*
144 | 
145 | - When you're in the directory where you keep your analysis projects, clone the python project: `git clone git@some.git.domain:path/to/git_project.git`
146 | - `cd` into the project and run `python .first_install.py`
147 |   - This step will create the projects virtual environment, install all necessary packages included in the `Pipfile` using the major python version defined in the `Pipfile`, and use the `.Rmd` files in the project to generate `.ipynb` files to work with. 
148 | 
149 | 
150 | **Legacy rmarkdown rendering**
151 | 
152 | Before we started using Quarto, this template generated R-style html reports via rmarkdown. We did this because rmarkdown generated better tables and more beautiful reports. To achieve it, we would actually pass the Jupytext-paired `.Rmd` file to rmarkdown via an Rscript. This required writing R cells in our analyses to get R style tables. For Altair charts, we'd have to pass the chart json to an R library that knew how to deal with vega charts. These cells wouldn't run until we rendered the report. This is the main reason for switching to Quarto, which allows us to have notebook output that matches what we'll see in the rendered report, and the result is just as beautiful. However, there may come a time, when we find rendering an `.Rmd` via rmarkdown useful. For that reason, we are keeping the rmarkdown rendering script. Keep in mind that to make use of it, you'll need to start an analysis with the Jupyter notebook template `rmarkdown.ipynb`. Then you can render an analysis using that template with `pipenv run export_rmarkdown path/to/analysis.Rmd`.
153 | 
154 | ## Configuration
155 | 
156 | You can set the default name, email, etc. for a project in the `cookiecutter.json` file.
157 | 


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/.export_rmarkdown.R:
--------------------------------------------------------------------------------
 1 | main <- function() {
 2 |   # Exports Rmd as html from the command line.
 3 |   #
 4 |   # Takes one argument:
 5 |   # Rmd file to convert
 6 |   #
 7 |   library(rmarkdown)
 8 |   args <- commandArgs(trailingOnly = TRUE)
 9 |   rmarkdown_file <- args[1]
10 |   render(rmarkdown_file, output_dir='data/html_reports')
11 | }
12 | 
13 | main()


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/.first_install.py:
--------------------------------------------------------------------------------
 1 | import os.path
 2 | import glob
 3 | import argparse
 4 | from subprocess import check_output
 5 | from subprocess import run
 6 | 
 7 | run(['mkdir', './.venv'])
 8 | 
 9 | PYENV_VERSION = "".join(check_output(['pyenv', 'version-name']).decode('utf-8').split())
10 | PYENV_PREFIX = "".join(check_output(['pyenv', 'prefix', f"{PYENV_VERSION}"]).decode('utf-8').split())
11 | 
12 | parser = argparse.ArgumentParser()
13 | parser.add_argument('--python', help='Python version to use with the project')
14 | args = parser.parse_args()
15 | 
16 | if os.path.isfile('./Pipfile'):
17 |     if args.python:
18 |         run(['pipenv', 'install', '--python', f"{args.python}", '--dev'])
19 |     else:
20 |         run(['pipenv', 'install', '--dev'])
21 | else:
22 |     run(['pipenv', 'install', '--python', f"{PYENV_PREFIX}/bin/python", 'ipython', 'ipykernel', 'pandas', 'matplotlib', 'notebook', 'jupyterlab', 'pyarrow', 'altair', 'jupytext', 'jupyterlab_templates', 'itables', 'ap-altair-theme'])
23 |     ## Add this script to the Pipfile, along with the rmarkdown export script
24 |     with open('Pipfile', 'a') as pipfile:
25 |         pipfile.write('\n[scripts]\nexport_rmarkdown = "Rscript .export_rmarkdown.R"')
26 | 
27 | VENV_DIR = "".join(check_output(['pipenv', '--venv']).decode('utf-8').split())
28 | RETICULATE_PYTHON = check_output(['pipenv', 'run', 'which', 'python']).decode('utf-8')
29 | TEMPLATE_PATHS = glob.glob('analysis/notebook_templates/*')
30 | 
31 | # Need to set the Jupyter data directory, this is where jupyter looks for kernels
32 | with open ('.env', 'w') as env_fi:
33 |     env_fi.write(f"JUPYTER_DATA_DIR={VENV_DIR}/share/jupyter/\n")
34 | # Need to tell R which python executable to use. Necessary for exporting rmarkdown reports as html.
35 | with open ('.Renviron', 'w') as Renv_fi:
36 |     Renv_fi.write(f"RETICULATE_PYTHON={RETICULATE_PYTHON}")
37 | 
38 | # Generate ipynb for every markdown file in analysis
39 | run(['pipenv', 'run', 'jupytext', '--set-formats', 'Rmd,ipynb', 'analysis/*.Rmd'])
40 | # Install jupyter template extension and enable the template server
41 | run(['mkdir', f"{VENV_DIR}/share/jupyter/notebook_templates"])
42 | for path in TEMPLATE_PATHS:
43 |     run(['cp', '-r', path, f"{VENV_DIR}/share/jupyter/notebook_templates/"])
44 | # Git solution for changing cwd in analysis files to root of project
45 | run(['pipenv', 'run', 'bash', '.set_kernel_path.sh'])
46 | 


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/.gitignore:
--------------------------------------------------------------------------------
 1 | *.vim
 2 | .env
 3 | .Renviron
 4 | .venv
 5 | .quarto
 6 | .DS_Store
 7 | .ipynb_checkpoints
 8 | 
 9 | analysis/*.ipynb
10 | analysis/archive/*.ipynb
11 | !analysis/notebook_templates/*.ipynb
12 | 
13 | scratch/*
14 | !scratch/.gitkeep
15 | 


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/.nbconvert_templates/ap_report/ap.svg:
--------------------------------------------------------------------------------
 1 | <?xml version="1.0" encoding="UTF-8" standalone="no"?>
 2 | <svg width="75px" height="88px" viewBox="0 0 75 88" version="1.1" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink">
 3 |     <!-- Generator: Sketch 39.1 (31720) - http://www.bohemiancoding.com/sketch -->
 4 |     <title>AP_RGB</title>
 5 |     <desc>Created with Sketch.</desc>
 6 |     <defs>
 7 |         <polygon id="path-1" points="0 87.0434084 75 87.0434084 75 0.23778135 0 0.23778135"></polygon>
 8 |     </defs>
 9 |     <g id="Page-1" stroke="none" stroke-width="1" fill="none" fill-rule="evenodd">
10 |         <g id="AP_RGB">
11 |             <g id="Group-4">
12 |                 <mask id="mask-2" fill="white">
13 |                     <use xlink:href="#path-1"></use>
14 |                 </mask>
15 |                 <g id="Clip-2"></g>
16 |                 <polygon id="Fill-1" fill="#FFFFFF" mask="url(#mask-2)" points="0.000723472669 77.9937299 75 77.9937299 75 0.23778135 0.000723472669 0.23778135"></polygon>
17 |                 <polyline id="Fill-3" fill="#FF322E" mask="url(#mask-2)" points="0 77.9937299 75 77.9937299 75 87.0578778 0.000723472669 87.0578778 0 77.9937299"></polyline>
18 |             </g>
19 |             <polyline id="Fill-5" fill="#000000" points="17.2632637 20.6990354 4.6562701 58.5147106 13.7840836 58.5147106 22.3471061 31.7852894 26.6389871 45.1485531 21.0870579 45.1485531 18.8698553 52.7768489 29.0889068 52.7768489 30.9315916 58.5147106 40.3203376 58.5147106 27.714791 20.6990354 17.2632637 20.6990354"></polyline>
20 |             <path d="M54.6639068,20.6990354 L41.5589228,20.6990354 L41.5589228,58.5147106 L50.6216238,58.5147106 L50.6216238,28.3275723 L54.2734727,28.3275723 C58.5108521,28.3275723 60.9233923,30.3482315 60.9233923,33.9995981 C60.9233923,37.5856109 58.5108521,39.6721061 54.2734727,39.6721061 L53.6867363,39.6721061 L53.6867363,47.3004019 L54.6639068,47.3004019 C64.4438103,47.3004019 70.1814309,42.3774116 70.1814309,33.9995981 C70.1814309,25.4585209 64.4438103,20.6990354 54.6639068,20.6990354" id="Fill-6" fill="#000000"></path>
21 |         </g>
22 |     </g>
23 | </svg>


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/.nbconvert_templates/ap_report/conf.json:
--------------------------------------------------------------------------------
 1 | {
 2 |     "base_template": "lab",
 3 |     "mimetypes": {
 4 |         "text/html": true
 5 |     },
 6 |     "preprocessors": {
 7 |         "100-pygments": {
 8 |             "type": "nbconvert.preprocessors.CSSHTMLHeaderPreprocessor",
 9 |             "enabled": true
10 |         }
11 |     }
12 | }
13 | 


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/.nbconvert_templates/ap_report/index.html.j2:
--------------------------------------------------------------------------------
 1 | {%- extends 'lab/index.html.j2' -%}
 2 | 
 3 | {%- block html_head_css -%}
 4 |   {{ super() }}
 5 |   {{ resources.include_css("static/style.css") }}
 6 | {%- endblock html_head_css -%}
 7 | 
 8 | {% block body_header %}
 9 |   <body class="jp-Notebook theme-light" style="padding: 0; margin: 0;">
10 |   <div class="header">
11 |     <div class="logo-container">
12 |       {% include "ap.svg" %}
13 |     </div>
14 |   </div>
15 |   <div style="padding: 20px; margin: 20px;">
16 | {% endblock body_header %}
17 | 
18 | {% block body_footer %}
19 |   </div>
20 |   </body>
21 | {% endblock body_footer %}


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/.nbconvert_templates/ap_report/static/style.css:
--------------------------------------------------------------------------------
 1 | 
 2 | @import url('https://fonts.googleapis.com/css2?family=Roboto&display=swap');
 3 | body {
 4 |   font-family: "Roboto", sans-serif;
 5 | }
 6 | h1, h2, h3 {
 7 |   font-family: "Roboto", sans-serif
 8 | }
 9 | .header {
10 |   overflow: hidden;
11 |   background-color: #f1f1f1;
12 |   padding: 20px 10px;
13 | }
14 | .logo-container {
15 |   /* margin: auto; */
16 |   width: fit-content;
17 | }
18 | img {
19 |   display: block;
20 |   margin: auto;
21 | }
22 | .jp-RenderedHTMLCommon {
23 |   width: 90%;
24 |   margin: auto;
25 | }
26 | /* not showing code cells */
27 | .jp-InputArea {
28 |   display: none;
29 | }


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/.set_kernel_path.sh:
--------------------------------------------------------------------------------
 1 | #!/bin/bash
 2 | 
 3 | ## Set kernels to run from where git was initialized (the project root)
 4 | ## If git wasn't initialized, notebooks run from where the notebook file is (the default)
 5 | 
 6 | JUPYTER_DATA_DIR=$(jupyter --data-dir)
 7 | KERNEL_PATH=$JUPYTER_DATA_DIR/kernels/python3 ## python3 is hardcoded, is there a way to print the name out?
 8 | VENV_DIR=$(pipenv --venv)
 9 | 
10 | ## Replace the default kernel.json with one that points to kernel.sh
11 | echo -e '{\n "argv": [' > $KERNEL_PATH/kernel.json
12 | echo "  \"$KERNEL_PATH/kernel.sh\"," >> $KERNEL_PATH/kernel.json
13 | echo -e '  "{connection_file}"\n ],\n "display_name": "{{cookiecutter.project_slug}}",\n "language":"python",\n "metadata":{\n "debugger":true\n }\n}' >> $KERNEL_PATH/kernel.json
14 | 
15 | ## kernel.sh first changes directory to git root before invoking the default command to launch a kernel
16 | echo -e '#!/bin/bash\ncd "$(git rev-parse --show-toplevel)"' > $KERNEL_PATH/kernel.sh
17 | echo "exec $VENV_DIR/bin/python -m ipykernel_launcher -f \"\$1\"" >> $KERNEL_PATH/kernel.sh
18 | 
19 | ## Execute permissions
20 | chmod 777 $KERNEL_PATH/kernel.sh
21 | 


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/README.md:
--------------------------------------------------------------------------------
 1 | # {{ cookiecutter.project_name }}
 2 | 
 3 | {{ cookiecutter.project_short_description }}
 4 | 
 5 | *Created by {{ cookiecutter.full_name }} (<{{ cookiecutter.email }}>)*
 6 | 
 7 | *Reporter: {{ cookiecutter.full_name }} (<{{ cookiecutter.email }}>)*
 8 | 
 9 | ## Project goal
10 | 
11 | *TK: Briefly describe this project*
12 | 
13 | ## Project notes
14 | 
15 | ### Staff involved
16 | 
17 | *TK: List people & contact info for people involved in the project*
18 | 
19 | [Responsibility matrix](url-to-responsibility matrix)
20 | 
21 | [HIRUFF Q&A](url-to-hiruff)
22 | 
23 | ### Data sources
24 | 
25 | *TK: List access info & contact info for data sources used in the project*
26 | 
27 | ## Technical
28 | 
29 | *TK: Instructions on how to bootstrap project, run ETL processes, etc.*
30 | 
31 | ### Project setup instructions
32 | 
33 | After cloning the git repo:
34 | 
35 | `datakit data pull` to retrieve the data files.
36 | 
37 | 
38 | *TK: For more complex or unusual projects additional directions follow*
39 | 
40 | ## Data notes
41 | 
42 | *Add important caveats, limitations, and source contact info here.*
43 | 


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/_quarto.yml:
--------------------------------------------------------------------------------
 1 | project:
 2 |   output-dir: data/html_reports
 3 |   execute-dir: project
 4 | number-sections: false
 5 | toc: true
 6 | toc-location: left
 7 | output:
 8 |   html_document:
 9 |     keep_md: false
10 |     keep_ipynb: false
11 | format:
12 |   html:
13 |     theme: simplex
14 |     smooth-scroll: true
15 |     embed-resources: true
16 |     standalone: true
17 |   pdf:
18 |     documentclass: report
19 |     margin-left: 30mm
20 |     margin-right: 30mm
21 | 


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/analysis/.gitkeep:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/associatedpress/cookiecutter-python-project/efc26a366d993177d7da84011d7437acf99f4a55/{{cookiecutter.project_slug}}/analysis/.gitkeep


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/analysis/archive/.gitkeep:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/associatedpress/cookiecutter-python-project/efc26a366d993177d7da84011d7437acf99f4a55/{{cookiecutter.project_slug}}/analysis/archive/.gitkeep


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/analysis/notebook_templates/ap_data_team/quarto.ipynb:
--------------------------------------------------------------------------------
 1 | {
 2 |  "cells": [
 3 |   {
 4 |    "cell_type": "raw",
 5 |    "id": "c250529d-3381-4cf1-b9e1-da590fa12691",
 6 |    "metadata": {},
 7 |    "source": [
 8 |     "---\n",
 9 |     "title: Analysis Notebook\n",
10 |     "author: Author (author@ap.org)\n",
11 |     "date: now\n",
12 |     "execute:\n",
13 |     "  echo: false\n",
14 |     "---"
15 |    ]
16 |   },
17 |   {
18 |    "cell_type": "code",
19 |    "execution_count": 1,
20 |    "id": "5a5a737a-2729-4624-8df3-c3d01c60e0ab",
21 |    "metadata": {
22 |     "vscode": {
23 |      "languageId": "python"
24 |     }
25 |    },
26 |    "outputs": [],
27 |    "source": [
28 |     "import pandas as pd\n",
29 |     "import numpy as np\n",
30 |     "import altair as alt\n",
31 |     "import itables\n",
32 |     "from IPython.display import Markdown,HTML\n",
33 |     "def print_markdown(string):\n",
34 |     "    display(Markdown(string))"
35 |    ]
36 |   },
37 |   {
38 |    "cell_type": "code",
39 |    "execution_count": null,
40 |    "id": "15bac761",
41 |    "metadata": {
42 |     "vscode": {
43 |      "languageId": "python"
44 |     }
45 |    },
46 |    "outputs": [],
47 |    "source": [
48 |     "HTML(\n",
49 |     "'''\n",
50 |     "<style>\n",
51 |     "    canvas.marks { display: block; margin: auto; }\n",
52 |     "    div.vega-embed { width: 100%; }\n",
53 |     "</style>\n",
54 |     "'''\n",
55 |     ")"
56 |    ]
57 |   }
58 |  ],
59 |  "metadata": {
60 |   "jupytext": {
61 |    "formats": "ipynb,Rmd"
62 |   },
63 |   "kernelspec": {
64 |    "display_name": "",
65 |    "name": ""
66 |   },
67 |   "language_info": {
68 |    "name": ""
69 |   }
70 |  },
71 |  "nbformat": 4,
72 |  "nbformat_minor": 5
73 | }
74 | 


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/analysis/notebook_templates/ap_data_team/rmarkdown.ipynb:
--------------------------------------------------------------------------------
 1 | {
 2 |  "cells": [
 3 |   {
 4 |    "cell_type": "raw",
 5 |    "metadata": {
 6 |     "tags": []
 7 |    },
 8 |    "source": [
 9 |     "---\n",
10 |     "title: \"Rmarkdown Analysis Notebook\"\n",
11 |     "author: \"Author (author@ap.org)\"\n",
12 |     "date: \"`r format(Sys.time(), '%d %B, %Y')`\"\n",
13 |     "output:\n",
14 |     "  html_document:\n",
15 |     "    fig_width: 8\n",
16 |     "    highlight: haddock\n",
17 |     "    keep_md: false\n",
18 |     "    theme: cerulean\n",
19 |     "    toc: yes\n",
20 |     "    toc_float:\n",
21 |     "      collapsed: false\n",
22 |     "---"
23 |    ]
24 |   },
25 |   {
26 |    "cell_type": "markdown",
27 |    "metadata": {
28 |     "jupyter": {
29 |      "source_hidden": true
30 |     },
31 |     "tags": []
32 |    },
33 |    "source": [
34 |     "```{r, setup, echo=F, include=F}\n",
35 |     "knitr::opts_knit$set(root.dir = '../')\n",
36 |     "knitr::opts_chunk$set(echo=FALSE)\n",
37 |     "library(DT)\n",
38 |     "library(reticulate)\n",
39 |     "library(vegawidget)\n",
40 |     "```"
41 |    ]
42 |   },
43 |   {
44 |    "cell_type": "markdown",
45 |    "metadata": {},
46 |    "source": [
47 |     "## Analysis Notebook"
48 |    ]
49 |   },
50 |   {
51 |    "cell_type": "code",
52 |    "execution_count": 1,
53 |    "metadata": {},
54 |    "outputs": [],
55 |    "source": [
56 |     "import pandas as pd\n",
57 |     "import numpy as np\n",
58 |     "import altair as alt"
59 |    ]
60 |   }
61 |  ],
62 |  "metadata": {
63 |   "jupytext": {
64 |    "formats": "ipynb,Rmd"
65 |   },
66 |   "language_info": {
67 |    "codemirror_mode": {
68 |     "name": "ipython",
69 |     "version": 3
70 |    },
71 |    "file_extension": ".py",
72 |    "mimetype": "text/x-python",
73 |    "name": "python",
74 |    "nbconvert_exporter": "python",
75 |    "pygments_lexer": "ipython3",
76 |    "version": "3.7.5"
77 |   }
78 |  },
79 |  "nbformat": 4,
80 |  "nbformat_minor": 4
81 | }
82 | 


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/data/.gitignore:
--------------------------------------------------------------------------------
1 | *
2 | !.gitignore
3 | !documentation
4 | !handmade
5 | !html_reports
6 | !processed
7 | !public
8 | !source


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/data/documentation/.gitignore:
--------------------------------------------------------------------------------
1 | *
2 | !.gitignore


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/data/handmade/.gitignore:
--------------------------------------------------------------------------------
1 | *
2 | !.gitignore


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/data/html_reports/.gitignore:
--------------------------------------------------------------------------------
1 | *
2 | !.gitignore


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/data/processed/.gitignore:
--------------------------------------------------------------------------------
1 | *
2 | !.gitignore


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/data/public/.gitignore:
--------------------------------------------------------------------------------
1 | *
2 | !.gitignore


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/data/source/.gitignore:
--------------------------------------------------------------------------------
1 | *
2 | !.gitignore


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/etl/.gitkeep:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/associatedpress/cookiecutter-python-project/efc26a366d993177d7da84011d7437acf99f4a55/{{cookiecutter.project_slug}}/etl/.gitkeep


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/publish/.gitkeep:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/associatedpress/cookiecutter-python-project/efc26a366d993177d7da84011d7437acf99f4a55/{{cookiecutter.project_slug}}/publish/.gitkeep


--------------------------------------------------------------------------------
/{{cookiecutter.project_slug}}/scratch/.gitkeep:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/associatedpress/cookiecutter-python-project/efc26a366d993177d7da84011d7437acf99f4a55/{{cookiecutter.project_slug}}/scratch/.gitkeep


--------------------------------------------------------------------------------