├── .gitignore ├── LICENSE ├── README.md ├── qacrawler ├── README.md ├── __init__.py ├── crawler.py ├── driver_wrapper.py ├── google_dom_info.py ├── jeopardy.py ├── main.py └── sr_parser.py ├── requirements.txt └── tests ├── .DS_Store ├── data ├── cheese - Google Search.html ├── one_result.html ├── parsed.tsv └── tiny_dataset.json ├── main_test.py ├── study_codes.py ├── test_crawler.py └── test_jeopardy.py /.gitignore: -------------------------------------------------------------------------------- 1 | .DS_Store 2 | -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/README.md -------------------------------------------------------------------------------- /qacrawler/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/qacrawler/README.md -------------------------------------------------------------------------------- /qacrawler/__init__.py: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /qacrawler/crawler.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/qacrawler/crawler.py -------------------------------------------------------------------------------- /qacrawler/driver_wrapper.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/qacrawler/driver_wrapper.py -------------------------------------------------------------------------------- /qacrawler/google_dom_info.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/qacrawler/google_dom_info.py -------------------------------------------------------------------------------- /qacrawler/jeopardy.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/qacrawler/jeopardy.py -------------------------------------------------------------------------------- /qacrawler/main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/qacrawler/main.py -------------------------------------------------------------------------------- /qacrawler/sr_parser.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/qacrawler/sr_parser.py -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/requirements.txt -------------------------------------------------------------------------------- /tests/.DS_Store: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/tests/.DS_Store -------------------------------------------------------------------------------- /tests/data/cheese - Google Search.html: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/tests/data/cheese - Google Search.html -------------------------------------------------------------------------------- /tests/data/one_result.html: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/tests/data/one_result.html -------------------------------------------------------------------------------- /tests/data/parsed.tsv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/tests/data/parsed.tsv -------------------------------------------------------------------------------- /tests/data/tiny_dataset.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/tests/data/tiny_dataset.json -------------------------------------------------------------------------------- /tests/main_test.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/tests/main_test.py -------------------------------------------------------------------------------- /tests/study_codes.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/tests/study_codes.py -------------------------------------------------------------------------------- /tests/test_crawler.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/tests/test_crawler.py -------------------------------------------------------------------------------- /tests/test_jeopardy.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nyu-dl/dl4ir-searchQA/HEAD/tests/test_jeopardy.py --------------------------------------------------------------------------------