# pythonCrawl **Repository Path**: xhongc/pythonCrawl ## Basic Information - **Project Name**: pythonCrawl - **Description**: 些许 爬虫项目 - **Primary Language**: HTML - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2018-02-23 - **Last Updated**: 2020-12-19 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README ## 一些爬虫项目
### scrapy-redis+MySQL 实现分布式爬虫 储存Mysql数据库 [lagou](https://github.com/xhongc/pythonCrawl/tree/master/lagou)
### 基于scrapy 实现模拟登陆 [login](https://github.com/xhongc/pythonCrawl/tree/master/login)
### 今日头条/豆瓣图书/淘宝美食/猫眼电影 requests+musql+multiprocessing 多进程爬虫 [爬虫系列](https://github.com/xhongc/pythonCrawl)
### IP 代理池爬虫(自动筛选储存) [IP爬虫](https://github.com/xhongc/pythonCrawl)
### 利用tesseract库实现简单验证码的识别 [验证码识别](https://github.com/xhongc/pythonCrawl)