├── README.md ├── getAsinToSql ├── getAsinToSql │ ├── __init__.py │ ├── __pycache__ │ │ ├── __init__.cpython-36.pyc │ │ ├── items.cpython-36.pyc │ │ ├── middlewares.cpython-36.pyc │ │ ├── pipelines.cpython-36.pyc │ │ └── settings.cpython-36.pyc │ ├── cookies_17.txt │ ├── detail_page │ │ ├── amazon_jp_0918_0.csv │ │ └── amazon_jp_0918_450.csv │ ├── items.py │ ├── main_mysql.py │ ├── middlewares.py │ ├── pipelines.py │ ├── seed_words │ │ ├── 日本5000.xlsx │ │ └── 词频.xlsx │ ├── settings.py │ ├── spiders │ │ ├── AsinToMySql.py │ │ ├── __init__.py │ │ └── __pycache__ │ │ │ ├── AsinToMySql.cpython-36.pyc │ │ │ └── __init__.cpython-36.pyc │ ├── 创建数据库中的表和insert.py │ ├── 搭建cookie池-删除重新获取.py │ └── 搭建cookie池-日本.py └── scrapy.cfg └── 专爬详情页的-日本站使用Tor ├── Amazon_ASIN ├── __pycache__ │ ├── __init__.cpython-36.pyc │ ├── items.cpython-36.pyc │ ├── middlewares.cpython-36.pyc │ ├── pipelines.cpython-36.pyc │ └── settings.cpython-36.pyc ├── detail_page │ ├── __init__.py │ ├── detail_0918_0.csv │ ├── detail_0918_50.csv │ ├── detail_0927_15500.csv │ ├── detail_1007_28500.csv │ └── test_jp.csv ├── items.py ├── main.py ├── middlewares.py ├── pipelines.py ├── settings.py └── spiders │ ├── ASIN.py │ ├── __init__.py │ └── __pycache__ │ ├── ASIN.cpython-36.pyc │ └── __init__.cpython-36.pyc ├── __init__.py ├── requirements.txt ├── scrapy.cfg └── 命令.txt /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/README.md -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/__init__.py: -------------------------------------------------------------------------------- 1 | 2 | -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/__pycache__/__init__.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/__pycache__/__init__.cpython-36.pyc -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/__pycache__/items.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/__pycache__/items.cpython-36.pyc -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/__pycache__/middlewares.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/__pycache__/middlewares.cpython-36.pyc -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/__pycache__/pipelines.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/__pycache__/pipelines.cpython-36.pyc -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/__pycache__/settings.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/__pycache__/settings.cpython-36.pyc -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/cookies_17.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/cookies_17.txt -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/detail_page/amazon_jp_0918_0.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/detail_page/amazon_jp_0918_0.csv -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/detail_page/amazon_jp_0918_450.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/detail_page/amazon_jp_0918_450.csv -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/items.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/items.py -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/main_mysql.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/main_mysql.py -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/middlewares.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/middlewares.py -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/pipelines.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/pipelines.py -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/seed_words/日本5000.xlsx: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/seed_words/日本5000.xlsx -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/seed_words/词频.xlsx: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/seed_words/词频.xlsx -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/settings.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/settings.py -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/spiders/AsinToMySql.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/spiders/AsinToMySql.py -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/spiders/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/spiders/__init__.py -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/spiders/__pycache__/AsinToMySql.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/spiders/__pycache__/AsinToMySql.cpython-36.pyc -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/spiders/__pycache__/__init__.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/spiders/__pycache__/__init__.cpython-36.pyc -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/创建数据库中的表和insert.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/创建数据库中的表和insert.py -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/搭建cookie池-删除重新获取.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/搭建cookie池-删除重新获取.py -------------------------------------------------------------------------------- /getAsinToSql/getAsinToSql/搭建cookie池-日本.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/getAsinToSql/搭建cookie池-日本.py -------------------------------------------------------------------------------- /getAsinToSql/scrapy.cfg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/getAsinToSql/scrapy.cfg -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/__pycache__/__init__.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/__pycache__/__init__.cpython-36.pyc -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/__pycache__/items.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/__pycache__/items.cpython-36.pyc -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/__pycache__/middlewares.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/__pycache__/middlewares.cpython-36.pyc -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/__pycache__/pipelines.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/__pycache__/pipelines.cpython-36.pyc -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/__pycache__/settings.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/__pycache__/settings.cpython-36.pyc -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/detail_page/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/detail_page/__init__.py -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/detail_page/detail_0918_0.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/detail_page/detail_0918_0.csv -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/detail_page/detail_0918_50.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/detail_page/detail_0918_50.csv -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/detail_page/detail_0927_15500.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/detail_page/detail_0927_15500.csv -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/detail_page/detail_1007_28500.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/detail_page/detail_1007_28500.csv -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/detail_page/test_jp.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/detail_page/test_jp.csv -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/items.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/items.py -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/main.py -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/middlewares.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/middlewares.py -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/pipelines.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/pipelines.py -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/settings.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/settings.py -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/spiders/ASIN.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/spiders/ASIN.py -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/spiders/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/spiders/__init__.py -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/spiders/__pycache__/ASIN.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/spiders/__pycache__/ASIN.cpython-36.pyc -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/Amazon_ASIN/spiders/__pycache__/__init__.cpython-36.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/Amazon_ASIN/spiders/__pycache__/__init__.cpython-36.pyc -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/__init__.py -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/requirements.txt -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/scrapy.cfg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/scrapy.cfg -------------------------------------------------------------------------------- /专爬详情页的-日本站使用Tor/命令.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jevy146/scrapy_amazon_jp/HEAD/专爬详情页的-日本站使用Tor/命令.txt --------------------------------------------------------------------------------