Scrapy, a fast high-level web crawling & scraping framework for Python.
Python 44.4k 9.7k
The scrapy.org website
HTML 48 154
Library to populate items using XPath and CSS with a convenient API
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Python library of web-related functions
Common interface for data container classes
A service daemon to run Scrapy spiders
A pure-Python robots.txt parser with support for modern conventions.
https://mimesniff.spec.whatwg.org/ implementation for Python
This is a sample Scrapy project for educational purposes