# Python-spider **Repository Path**: frank_yeats/Python-spider ## Basic Information - **Project Name**: Python-spider - **Description**: No description available - **Primary Language**: Python - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2020-09-06 - **Last Updated**: 2020-12-19 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # python爬虫 [参考网站](https://www.runoob.com/w3cnote/python-spider-intro.html) 1. requests库:自动爬取HTML页面,自动网络请求提交 2. robots.txt:网络爬虫排除标准 3. Beautiful Soup库:解析HTML页面 4. Projects A/B:实战项目 5. RE:正则表达式详解提取页面关键信息 6. Scrapy*:网络爬虫原理介绍,专业爬虫框架介绍 IDE: 文本工具: 1. IDLE 2. Sublime Text 集成工具: 1. VS Code 2. Pycharm