├── .github └── workflows │ └── gh-pages.yml ├── .gitignore ├── CODE_OF_CONDUCT.md ├── CONTRIBUTING.md ├── LICENSE.md ├── README.md ├── csv_example ├── csv_evaluation.py ├── csv_example.py ├── csv_example_input_with_true_ids.csv ├── csv_example_messy_input.csv └── requirements.txt ├── extended-variables ├── officers.csv ├── officers.py └── requirements.txt ├── gazetteer_example ├── README.md ├── data │ ├── AbtBuy_Abt.csv │ └── AbtBuy_Buy.csv ├── gazetteer_evaluation.py ├── gazetteer_example.py ├── gazetteer_postgres_example.py ├── requirements-1.x.txt └── requirements-2.x.txt ├── mysql_example ├── README.md ├── mysql.cnf_LOCAL ├── mysql_example.py ├── mysql_init_db.py └── requirements.txt ├── patent_example ├── README.md ├── patent_evaluation.py ├── patent_example.py ├── patstat_input.csv └── patstat_reference.csv ├── pgsql_big_dedupe_example ├── README.md ├── pgsql_big_dedupe_example.py ├── pgsql_big_dedupe_example_init_db.py └── requirements.txt ├── record_linkage_example ├── AbtBuy_Abt.csv ├── AbtBuy_Buy.csv ├── record_linkage_example.py ├── record_linkage_example_evaluation.py └── requirements.txt └── requirements.txt /.github/workflows/gh-pages.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/.github/workflows/gh-pages.yml -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/.gitignore -------------------------------------------------------------------------------- /CODE_OF_CONDUCT.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/CODE_OF_CONDUCT.md -------------------------------------------------------------------------------- /CONTRIBUTING.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/CONTRIBUTING.md -------------------------------------------------------------------------------- /LICENSE.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/LICENSE.md -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/README.md -------------------------------------------------------------------------------- /csv_example/csv_evaluation.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/csv_example/csv_evaluation.py -------------------------------------------------------------------------------- /csv_example/csv_example.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/csv_example/csv_example.py -------------------------------------------------------------------------------- /csv_example/csv_example_input_with_true_ids.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/csv_example/csv_example_input_with_true_ids.csv -------------------------------------------------------------------------------- /csv_example/csv_example_messy_input.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/csv_example/csv_example_messy_input.csv -------------------------------------------------------------------------------- /csv_example/requirements.txt: -------------------------------------------------------------------------------- 1 | unidecode 2 | -------------------------------------------------------------------------------- /extended-variables/officers.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/extended-variables/officers.csv -------------------------------------------------------------------------------- /extended-variables/officers.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/extended-variables/officers.py -------------------------------------------------------------------------------- /extended-variables/requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/extended-variables/requirements.txt -------------------------------------------------------------------------------- /gazetteer_example/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/gazetteer_example/README.md -------------------------------------------------------------------------------- /gazetteer_example/data/AbtBuy_Abt.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/gazetteer_example/data/AbtBuy_Abt.csv -------------------------------------------------------------------------------- /gazetteer_example/data/AbtBuy_Buy.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/gazetteer_example/data/AbtBuy_Buy.csv -------------------------------------------------------------------------------- /gazetteer_example/gazetteer_evaluation.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/gazetteer_example/gazetteer_evaluation.py -------------------------------------------------------------------------------- /gazetteer_example/gazetteer_example.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/gazetteer_example/gazetteer_example.py -------------------------------------------------------------------------------- /gazetteer_example/gazetteer_postgres_example.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/gazetteer_example/gazetteer_postgres_example.py -------------------------------------------------------------------------------- /gazetteer_example/requirements-1.x.txt: -------------------------------------------------------------------------------- 1 | dj-database-url 2 | psycopg2 3 | dedupe<2.0.0 4 | -------------------------------------------------------------------------------- /gazetteer_example/requirements-2.x.txt: -------------------------------------------------------------------------------- 1 | dedupe>=2.0.0 2 | unidecode 3 | -------------------------------------------------------------------------------- /mysql_example/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/mysql_example/README.md -------------------------------------------------------------------------------- /mysql_example/mysql.cnf_LOCAL: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/mysql_example/mysql.cnf_LOCAL -------------------------------------------------------------------------------- /mysql_example/mysql_example.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/mysql_example/mysql_example.py -------------------------------------------------------------------------------- /mysql_example/mysql_init_db.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/mysql_example/mysql_init_db.py -------------------------------------------------------------------------------- /mysql_example/requirements.txt: -------------------------------------------------------------------------------- 1 | mysqlclient 2 | -------------------------------------------------------------------------------- /patent_example/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/patent_example/README.md -------------------------------------------------------------------------------- /patent_example/patent_evaluation.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/patent_example/patent_evaluation.py -------------------------------------------------------------------------------- /patent_example/patent_example.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/patent_example/patent_example.py -------------------------------------------------------------------------------- /patent_example/patstat_input.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/patent_example/patstat_input.csv -------------------------------------------------------------------------------- /patent_example/patstat_reference.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/patent_example/patstat_reference.csv -------------------------------------------------------------------------------- /pgsql_big_dedupe_example/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/pgsql_big_dedupe_example/README.md -------------------------------------------------------------------------------- /pgsql_big_dedupe_example/pgsql_big_dedupe_example.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/pgsql_big_dedupe_example/pgsql_big_dedupe_example.py -------------------------------------------------------------------------------- /pgsql_big_dedupe_example/pgsql_big_dedupe_example_init_db.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/pgsql_big_dedupe_example/pgsql_big_dedupe_example_init_db.py -------------------------------------------------------------------------------- /pgsql_big_dedupe_example/requirements.txt: -------------------------------------------------------------------------------- 1 | dj-database-url>=0.3.0 2 | psycopg2>=2.5.4 3 | unidecode>=0.04.16 4 | requests 5 | -------------------------------------------------------------------------------- /record_linkage_example/AbtBuy_Abt.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/record_linkage_example/AbtBuy_Abt.csv -------------------------------------------------------------------------------- /record_linkage_example/AbtBuy_Buy.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/record_linkage_example/AbtBuy_Buy.csv -------------------------------------------------------------------------------- /record_linkage_example/record_linkage_example.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/record_linkage_example/record_linkage_example.py -------------------------------------------------------------------------------- /record_linkage_example/record_linkage_example_evaluation.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dedupeio/dedupe-examples/HEAD/record_linkage_example/record_linkage_example_evaluation.py -------------------------------------------------------------------------------- /record_linkage_example/requirements.txt: -------------------------------------------------------------------------------- 1 | unidecode 2 | -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- 1 | dedupe>=3.0.0 2 | Unidecode==0.4.16 3 | --------------------------------------------------------------------------------