Part of lp.registry.scripts.productreleasefinder.walker View In Hierarchy
This class implements a walker for the HTTP and HTTPS URL schemes. It works by assuming any URL ending with a / is a directory, and every other URL a file. URLs are tested using HEAD to see whether they cause a redirect to one ending with a /.
HTML Directory pages are parsed to find all links within them that lead to deeper URLs; this way it isn't tied to the Apache directory listing format and can actually walk arbitrary trees.
Method | open | Open the HTTP connection. |
Method | close | Close the HTTP connection. |
Method | request | Make an HTTP request. |
Method | isDirectory | Return whether the path is a directory. |
Method | list | Download the HTML index at subdir and scrape for URLs. |
Inherited from WalkerBase:
Method | __init__ | Undocumented |
Method | walk | Walk through the URL. |
Assumes any path ending in a slash is a directory, and any that redirects to a location ending in a slash is also a directory.