# pythonCrawl
**Repository Path**: xhongc/pythonCrawl
## Basic Information
- **Project Name**: pythonCrawl
- **Description**: 些许 爬虫项目
- **Primary Language**: HTML
- **License**: Not specified
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2018-02-23
- **Last Updated**: 2020-12-19
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
## 一些爬虫项目
### scrapy-redis+MySQL 实现分布式爬虫 储存Mysql数据库
[lagou](https://github.com/xhongc/pythonCrawl/tree/master/lagou)
### 基于scrapy 实现模拟登陆
[login](https://github.com/xhongc/pythonCrawl/tree/master/login)
### 今日头条/豆瓣图书/淘宝美食/猫眼电影 requests+musql+multiprocessing 多进程爬虫
[爬虫系列](https://github.com/xhongc/pythonCrawl)
### IP 代理池爬虫(自动筛选储存)
[IP爬虫](https://github.com/xhongc/pythonCrawl)
### 利用tesseract库实现简单验证码的识别
[验证码识别](https://github.com/xhongc/pythonCrawl)