# webcrawler_buk **Repository Path**: bukcn/webcrawler_buk ## Basic Information - **Project Name**: webcrawler_buk - **Description**: No description available - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-04-28 - **Last Updated**: 2025-11-24 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README pip install scrapy -i https://pypi.tuna.tsinghua.edu.cn/simple pip install scrapy-deltafetch -i https://pypi.tuna.tsinghua.edu.cn/simple # Run scrapy crawl quotes ## Shell scrapy shell 'https://quotes.toscrape.com/page/1/' >>> response.css("title") >>> response.css("title::text").getall() >>> response.css("title::text").get() >>> response.css("title::text")[0].get() > > response.css("title::text").re(r"Quotes.*") > > response.css("title::text").re(r"Q\w+") > > response.css("title::text").re(r"(\w+) to (\w+)") > response.xpath("//title") > > > > # 运行下载的网站 ```angular2html cd .... python3 -m http.server 8001 ```