├── README.md ├── config.py ├── log.py └── tiktok_spider.py /README.md: -------------------------------------------------------------------------------- 1 | # tiktok_spider 2 | 爬取抖音评论2024 3 | 4 | 注:本文仅为兴趣爱好探究,请勿进行商业利用或非法研究,负责后果自负,与本文作者无关 5 | 6 | 7 | 首先在config.py里填写cookie字段 8 | 9 | ![image-20240301182658129](https://gitee.com/yuejinjianke/tuchuang/raw/master/image/image-20240301182658129.png) 10 | 11 | 其次在网页版登录抖音后搜索关键词点进想要爬取评论的视频 12 | 13 | 在f12 network处 或 url栏 会出现视频id值 14 | 15 | ![image-20240301182815697](https://gitee.com/yuejinjianke/tuchuang/raw/master/image/image-20240301182815697.png) 16 | 17 | 即在此处进行填入 18 | 19 | ![image-20240301182919410](https://gitee.com/yuejinjianke/tuchuang/raw/master/image/image-20240301182919410.png) 20 | 21 | 22 | 23 | 附上最后效果图 24 | 25 | ![image-20240301182958710](https://gitee.com/yuejinjianke/tuchuang/raw/master/image/image-20240301182958710.png) 26 | 27 | ![image-20240301183021296](https://gitee.com/yuejinjianke/tuchuang/raw/master/image/image-20240301183021296.png) 28 | 29 | ![image-20240301183029087](https://gitee.com/yuejinjianke/tuchuang/raw/master/image/image-20240301183029087.png) 30 | 31 | 32 | 33 | 34 | 原文链接 35 | 36 | https://mp.weixin.qq.com/s/3mZ66SBusCsZg7lqcusMzg 37 | 38 | 目前源码已下架,有需求者请通过公众号联系wx进行了解 39 | -------------------------------------------------------------------------------- /config.py: -------------------------------------------------------------------------------- 1 | # cookie字段 [必需] 2 | cookie = '' 3 | 4 | -------------------------------------------------------------------------------- /log.py: -------------------------------------------------------------------------------- 1 | import logging 2 | from logging.handlers import TimedRotatingFileHandler 3 | import time 4 | import os 5 | 6 | class Log_Recorder: 7 | def __init__(self): 8 | self.logger = self.get_logger() 9 | 10 | def get_logger(self): 11 | self.logger = logging.getLogger(__name__) 12 | # 日志格式 13 | formatter = logging.Formatter('[%(asctime)s-%(filename)s][%(funcName)s-%(lineno)d]--%(message)s', 14 | datefmt='%Y-%m-%d %H:%M:%S') 15 | # 日志级别 16 | self.logger.setLevel(logging.DEBUG) 17 | # 控制台日志 18 | sh = logging.StreamHandler() 19 | sh.setFormatter(formatter) 20 | self.logger.addHandler(sh) 21 | # info日志文件名 22 | case_dir = r'./logs/' 23 | if not os.path.exists(case_dir): 24 | os.makedirs(case_dir) 25 | info_file_name = time.strftime("%Y-%m-%d") + '.log' 26 | info_handler = TimedRotatingFileHandler(filename=case_dir + info_file_name, 27 | when='MIDNIGHT', 28 | interval=1, 29 | backupCount=7, 30 | encoding='utf-8') 31 | info_handler.setFormatter(formatter) 32 | self.logger.addHandler(info_handler) 33 | return self.logger 34 | -------------------------------------------------------------------------------- /tiktok_spider.py: -------------------------------------------------------------------------------- 1 | 2 | --------------------------------------------------------------------------------