mirror of
https://github.com/python/cpython.git
synced 2026-01-06 15:32:22 +00:00
The robotparser.py module currently lives in Tools/webchecker. In
preparation for its migration to Lib, I made the following changes:
* renamed the test() function _test
* corrected the URLs in _test() so they refer to actual documents
* added an "if __name__ == '__main__'" catcher to invoke _test()
when run as a main program
* added doc strings for the two main methods, parse and can_fetch
* replaced usage of regsub and regex with corresponding re code
|
||
|---|---|---|
| .. | ||
| README | ||
| robotparser.py | ||
| tktools.py | ||
| wcgui.py | ||
| wcmac.py | ||
| webchecker.py | ||
| websucker.py | ||
| wsgui.py | ||
Webchecker ---------- This is a simple web tree checker, useful to find bad links in a web tree. It currently checks links pointing within the same subweb for validity. The main program is "webchecker.py". See its doc string (or invoke it with the option "-?") for more defails. History: - Jan 1997. First release. The module robotparser.py was written by Skip Montanaro; the rest is original work by Guido van Rossum. - May 1999. Sam Bayer contributed a new version, wcnew.py, which supports checking internal links (#spam fragments in URLs) and some other options. - Nov 1999. Sam Bayer contributed patches to reintegrate wcnew.py into webchecker.py, and corresponding mods to wcgui.py and websucker.py.