├── .flake8 ├── .github ├── ISSUE_TEMPLATE │ ├── feature_request.md │ └── improve-parser.md ├── PULL_REQUEST_TEMPLATE.md └── workflows │ └── python-publish.yml ├── .gitignore ├── .pre-commit-config.yaml ├── LICENSE ├── MANIFEST.in ├── README.md ├── docs ├── Makefile ├── conf.py └── index.rst ├── measure_performance └── test_data │ ├── labeled.xml │ ├── multi_word_state_addresses.xml │ ├── simple_address_patterns.xml │ ├── synthetic_clean_osm_data.xml │ ├── synthetic_osm_data.xml │ └── us50_test_tagged.xml ├── parse_scripts ├── import_osm.py ├── parse.py └── parse_openaddress.py ├── pyproject.toml ├── raw ├── LICENSE.md ├── openaddresses │ └── us-ia-linn.json ├── osm_data.xml ├── osm_data_full_addr.xml ├── osm_data_street.xml ├── us50.test.raw ├── us50.test.tagged ├── us50.train.raw └── us50.train.tagged ├── setup.py ├── tests ├── test_labeling.py ├── test_tagging.py ├── test_token_features.py └── test_tokenizing.py ├── training ├── README.md ├── example_training.xml ├── labeled.xml ├── multi_word_state_addresses.xml ├── openaddress_us_ia_linn.xml ├── synthetic_clean_osm_data.xml ├── synthetic_osm_data_xml.xml ├── unparseable.csv ├── us50_messiest_manual_label.xml └── us50_train_tagged.xml └── usaddress ├── __init__.py └── __init__.pyi /.flake8: -------------------------------------------------------------------------------- 1 | [flake8] 2 | max-line-length=160 3 | extend-ignore = E203 -------------------------------------------------------------------------------- /.github/ISSUE_TEMPLATE/feature_request.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/.github/ISSUE_TEMPLATE/feature_request.md -------------------------------------------------------------------------------- /.github/ISSUE_TEMPLATE/improve-parser.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/.github/ISSUE_TEMPLATE/improve-parser.md -------------------------------------------------------------------------------- /.github/PULL_REQUEST_TEMPLATE.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/.github/PULL_REQUEST_TEMPLATE.md -------------------------------------------------------------------------------- /.github/workflows/python-publish.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/.github/workflows/python-publish.yml -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/.gitignore -------------------------------------------------------------------------------- /.pre-commit-config.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/.pre-commit-config.yaml -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/LICENSE -------------------------------------------------------------------------------- /MANIFEST.in: -------------------------------------------------------------------------------- 1 | include training/* -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/README.md -------------------------------------------------------------------------------- /docs/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/docs/Makefile -------------------------------------------------------------------------------- /docs/conf.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/docs/conf.py -------------------------------------------------------------------------------- /docs/index.rst: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/docs/index.rst -------------------------------------------------------------------------------- /measure_performance/test_data/labeled.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/measure_performance/test_data/labeled.xml -------------------------------------------------------------------------------- /measure_performance/test_data/multi_word_state_addresses.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/measure_performance/test_data/multi_word_state_addresses.xml -------------------------------------------------------------------------------- /measure_performance/test_data/simple_address_patterns.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/measure_performance/test_data/simple_address_patterns.xml -------------------------------------------------------------------------------- /measure_performance/test_data/synthetic_clean_osm_data.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/measure_performance/test_data/synthetic_clean_osm_data.xml -------------------------------------------------------------------------------- /measure_performance/test_data/synthetic_osm_data.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/measure_performance/test_data/synthetic_osm_data.xml -------------------------------------------------------------------------------- /measure_performance/test_data/us50_test_tagged.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/measure_performance/test_data/us50_test_tagged.xml -------------------------------------------------------------------------------- /parse_scripts/import_osm.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/parse_scripts/import_osm.py -------------------------------------------------------------------------------- /parse_scripts/parse.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/parse_scripts/parse.py -------------------------------------------------------------------------------- /parse_scripts/parse_openaddress.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/parse_scripts/parse_openaddress.py -------------------------------------------------------------------------------- /pyproject.toml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/pyproject.toml -------------------------------------------------------------------------------- /raw/LICENSE.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/raw/LICENSE.md -------------------------------------------------------------------------------- /raw/openaddresses/us-ia-linn.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/raw/openaddresses/us-ia-linn.json -------------------------------------------------------------------------------- /raw/osm_data.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/raw/osm_data.xml -------------------------------------------------------------------------------- /raw/osm_data_full_addr.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/raw/osm_data_full_addr.xml -------------------------------------------------------------------------------- /raw/osm_data_street.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/raw/osm_data_street.xml -------------------------------------------------------------------------------- /raw/us50.test.raw: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/raw/us50.test.raw -------------------------------------------------------------------------------- /raw/us50.test.tagged: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/raw/us50.test.tagged -------------------------------------------------------------------------------- /raw/us50.train.raw: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/raw/us50.train.raw -------------------------------------------------------------------------------- /raw/us50.train.tagged: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/raw/us50.train.tagged -------------------------------------------------------------------------------- /setup.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/setup.py -------------------------------------------------------------------------------- /tests/test_labeling.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/tests/test_labeling.py -------------------------------------------------------------------------------- /tests/test_tagging.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/tests/test_tagging.py -------------------------------------------------------------------------------- /tests/test_token_features.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/tests/test_token_features.py -------------------------------------------------------------------------------- /tests/test_tokenizing.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/tests/test_tokenizing.py -------------------------------------------------------------------------------- /training/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/training/README.md -------------------------------------------------------------------------------- /training/example_training.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/training/example_training.xml -------------------------------------------------------------------------------- /training/labeled.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/training/labeled.xml -------------------------------------------------------------------------------- /training/multi_word_state_addresses.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/training/multi_word_state_addresses.xml -------------------------------------------------------------------------------- /training/openaddress_us_ia_linn.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/training/openaddress_us_ia_linn.xml -------------------------------------------------------------------------------- /training/synthetic_clean_osm_data.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/training/synthetic_clean_osm_data.xml -------------------------------------------------------------------------------- /training/synthetic_osm_data_xml.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/training/synthetic_osm_data_xml.xml -------------------------------------------------------------------------------- /training/unparseable.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/training/unparseable.csv -------------------------------------------------------------------------------- /training/us50_messiest_manual_label.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/training/us50_messiest_manual_label.xml -------------------------------------------------------------------------------- /training/us50_train_tagged.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/training/us50_train_tagged.xml -------------------------------------------------------------------------------- /usaddress/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/usaddress/__init__.py -------------------------------------------------------------------------------- /usaddress/__init__.pyi: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/datamade/usaddress/HEAD/usaddress/__init__.pyi --------------------------------------------------------------------------------