├── .gitattributes ├── README.md ├── __init__.py ├── cookies.pkl ├── db ├── mysql.sql └── scrapy_result.sql ├── js_files ├── jquery.min.js └── jquery.xpath.js ├── rw_cookies.py ├── scrapy.cfg ├── start_scrape_taobao.py ├── taobao_crawler ├── __init__.py ├── items.py ├── pipelines.py ├── settings.py ├── spiders │ ├── __init__.py │ ├── dmoz.py │ ├── taobao_1.py │ └── taobao_2.py └── useragents.txt ├── 淘宝_空调_爬虫结果.xlsx └── 爬取淘宝数据的方法.md /.gitattributes: -------------------------------------------------------------------------------- 1 | js_files/* linguist-vendored 2 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/README.md -------------------------------------------------------------------------------- /__init__.py: -------------------------------------------------------------------------------- 1 | __author__ = 'easton' 2 | -------------------------------------------------------------------------------- /cookies.pkl: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /db/mysql.sql: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/db/mysql.sql -------------------------------------------------------------------------------- /db/scrapy_result.sql: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/db/scrapy_result.sql -------------------------------------------------------------------------------- /js_files/jquery.min.js: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/js_files/jquery.min.js -------------------------------------------------------------------------------- /js_files/jquery.xpath.js: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/js_files/jquery.xpath.js -------------------------------------------------------------------------------- /rw_cookies.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/rw_cookies.py -------------------------------------------------------------------------------- /scrapy.cfg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/scrapy.cfg -------------------------------------------------------------------------------- /start_scrape_taobao.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/start_scrape_taobao.py -------------------------------------------------------------------------------- /taobao_crawler/__init__.py: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /taobao_crawler/items.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/taobao_crawler/items.py -------------------------------------------------------------------------------- /taobao_crawler/pipelines.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/taobao_crawler/pipelines.py -------------------------------------------------------------------------------- /taobao_crawler/settings.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/taobao_crawler/settings.py -------------------------------------------------------------------------------- /taobao_crawler/spiders/__init__.py: -------------------------------------------------------------------------------- 1 | # Place here all your scrapy spiders 2 | -------------------------------------------------------------------------------- /taobao_crawler/spiders/dmoz.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/taobao_crawler/spiders/dmoz.py -------------------------------------------------------------------------------- /taobao_crawler/spiders/taobao_1.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/taobao_crawler/spiders/taobao_1.py -------------------------------------------------------------------------------- /taobao_crawler/spiders/taobao_2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/taobao_crawler/spiders/taobao_2.py -------------------------------------------------------------------------------- /taobao_crawler/useragents.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/taobao_crawler/useragents.txt -------------------------------------------------------------------------------- /淘宝_空调_爬虫结果.xlsx: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/淘宝_空调_爬虫结果.xlsx -------------------------------------------------------------------------------- /爬取淘宝数据的方法.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/EastonLee/Taobao_Crawler/HEAD/爬取淘宝数据的方法.md --------------------------------------------------------------------------------