It allows you to download a website from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer.
This is a python based website crawling script equipped with Random time intervals, User Agent switching and IP rotation through proxy server capabilities to trick the website robot and avoid getting blocked.
This a project to demonstrate the use of standard python libraries like os, urllib, HTMLParser to create a minimalist webpage crawler that crawls webpages on a website to gather hyperlinks (URLs)