# pythonCrawl

**Repository Path**: xhongc/pythonCrawl

## Basic Information

- **Project Name**: pythonCrawl
- **Description**: 些许 爬虫项目
- **Primary Language**: HTML
- **License**: Not specified
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 0
- **Forks**: 0
- **Created**: 2018-02-23
- **Last Updated**: 2020-12-19

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README

## 一些爬虫项目

<br/>

### scrapy-redis+MySQL 实现分布式爬虫 储存Mysql数据库
[lagou](https://github.com/xhongc/pythonCrawl/tree/master/lagou)
<br>
### 基于scrapy 实现模拟登陆
[login](https://github.com/xhongc/pythonCrawl/tree/master/login)
<br>
### 今日头条/豆瓣图书/淘宝美食/猫眼电影 requests+musql+multiprocessing 多进程爬虫
[爬虫系列](https://github.com/xhongc/pythonCrawl)
<br>
### IP 代理池爬虫（自动筛选储存）
[IP爬虫](https://github.com/xhongc/pythonCrawl)
<br>
### 利用tesseract库实现简单验证码的识别
[验证码识别](https://github.com/xhongc/pythonCrawl)