cpython/Lib
R. David Murray b579dba119 #1486713: Add a tolerant mode to HTMLParser.
The motivation for adding this option is that the the functionality it
provides used to be provided by sgmllib in Python2, and was used by,
for example, BeautifulSoup.  Without this option, the Python3 version
of BeautifulSoup and the many programs that use it are crippled.

The original patch was by 'kxroberto'.  I modified it heavily but kept his
heuristics and test.  I also added additional heuristics to fix #975556,
#1046092, and part of #6191.  This patch should be completely backward
compatible:  the behavior with the default strict=True is unchanged.
2010-12-03 04:06:39 +00:00
..
concurrent Removes an inefficient spin loop in as_completed 2010-11-17 11:06:29 +00:00
ctypes #9424: Replace deprecated assert* methods in the Python test suite. 2010-11-20 19:04:17 +00:00
curses #8861: remove unused variable. 2010-08-02 19:44:48 +00:00
dbm #9397: remove mention of dbm.bsd which does not exist anymore. 2010-07-29 16:26:56 +00:00
distutils sys.abiflags is not defined on all platforms. 2010-11-25 03:46:44 +00:00
email #8989: add 'domain' keyword to make_msgid. 2010-12-02 21:47:19 +00:00
encodings #7475: add (un)transform method to bytes/bytearray and str, add back codecs that can be used with them from Python 2. 2010-12-02 18:06:51 +00:00
html #1486713: Add a tolerant mode to HTMLParser. 2010-12-03 04:06:39 +00:00
http Fix issue3709 - BaseHTTPRequestHandler will buffer the headers and write only on end_headers call. 2010-11-21 14:36:14 +00:00
idlelib Issue 9222 Fix filetypes for open dialog 2010-11-23 06:01:31 +00:00
importlib #9424: Replace deprecated assert* methods in the Python test suite. 2010-11-20 19:04:17 +00:00
json Issue #10572: Moved json tests to Lib/test/json_tests. 2010-11-30 03:03:30 +00:00
lib2to3 revert changes in inappropiate branch 2010-11-20 19:09:08 +00:00
logging Logging: added stack_info argument. 2010-11-14 21:33:04 +00:00
msilib remove pointless coding cookies 2010-08-30 14:05:50 +00:00
multiprocessing Issue #9244: multiprocessing.pool: Worker crashes if result can't be encoded 2010-11-09 20:55:52 +00:00
plat-aix4
plat-darwin These are present in the trunk but no longer in 3.x, which causes problems 2009-03-04 23:21:07 +00:00
plat-freebsd4 Merged revisions 78644 via svnmerge from 2010-03-04 02:11:41 +00:00
plat-freebsd5 Merged revisions 78644 via svnmerge from 2010-03-04 02:11:41 +00:00
plat-freebsd6 Merged revisions 78644 via svnmerge from 2010-03-04 02:11:41 +00:00
plat-freebsd7 Merged revisions 78644 via svnmerge from 2010-03-04 02:11:41 +00:00
plat-freebsd8 Merged revisions 78644 via svnmerge from 2010-03-04 02:11:41 +00:00
plat-generic
plat-linux2
plat-netbsd1
plat-next3
plat-os2emx Merged revisions 78644 via svnmerge from 2010-03-04 02:11:41 +00:00
plat-sunos5 Merged revisions 78678,78680,78682 via svnmerge from 2010-03-05 03:33:11 +00:00
plat-unixware7
pydoc_data Update pydoc topics. 2010-11-13 06:36:58 +00:00
site-packages
sqlite3 #9424: Replace deprecated assert* methods in the Python test suite. 2010-11-20 19:04:17 +00:00
test #1486713: Add a tolerant mode to HTMLParser. 2010-12-03 04:06:39 +00:00
tkinter Issue #6878: Fixed return type of tkinter methods documented to return lists. 2010-11-22 19:40:51 +00:00
turtledemo Translated German message into English. 2010-11-01 18:42:01 +00:00
unittest Issue 7911: unittest.TestCase.longMessage defaults to True for improved failure messages by default 2010-12-03 00:53:09 +00:00
urllib Fix spelling of Jamie Zawinski's surname in urllib.parse docstring (issue 10606) 2010-12-02 16:41:00 +00:00
wsgiref Implement http://bugs.python.org/issue10155 using And Clover's patch, w/added 2010-11-03 22:39:01 +00:00
xml Fix Issue10205 - XML QName error when different tags have same QName. 2010-11-09 02:36:59 +00:00
xmlrpc Code Changes as per review comments by Antoine Pitrou. 2010-11-18 17:08:48 +00:00
__future__.py barry has already been causing test breakage 2009-04-01 20:38:13 +00:00
__phello__.foo.py
_abcoll.py Issue #10565: Iterator ABC should require both __next__ and __iter__. 2010-11-29 03:56:12 +00:00
_compat_pickle.py Issue #6137: The pickle module now translates module names when loading 2009-06-04 20:32:06 +00:00
_dummy_thread.py Fix bootstrap issues when building without threads 2010-11-05 19:58:28 +00:00
_markupbase.py Take advantage of the frozenset constant optimization. 2010-09-05 23:15:06 +00:00
_pyio.py Issue #10180: Pickling file objects is now explicitly forbidden, since 2010-11-05 21:15:39 +00:00
_strptime.py PEP 8 conformance: class_ -> cls 2010-07-14 13:46:57 +00:00
_threading_local.py Issue #9707: Rewritten reference implementation of threading.local which 2010-09-07 22:06:17 +00:00
_weakrefset.py Issue #7105: Make WeakKeyDictionary and WeakValueDictionary robust against 2010-01-08 17:54:23 +00:00
abc.py add support for abstract class and static methods #5867 2010-08-17 00:52:52 +00:00
aifc.py Merged revisions 72425-72426 via svnmerge from 2009-05-07 18:24:38 +00:00
antigravity.py Another hidden treasure. 2009-06-03 17:21:17 +00:00
argparse.py Fix bug 9340 - argparse parse_known_args didn't work with subparsers 2010-11-02 12:47:22 +00:00
ast.py Fix handling on negative numbers in ast.literal_eval(). 2010-10-08 00:47:45 +00:00
asynchat.py Fixed bugs 760475, 953599, and 1519. This is a translation of changelist 64768 2008-07-07 05:04:12 +00:00
asyncore.py Fix issue 6706: adds new handle_accepted() method to asyncore.dispatcher 2010-10-04 21:08:36 +00:00
base64.py #1466065: add validate option to base64.b64decode 2010-11-11 20:09:20 +00:00
bdb.py Use booleans where applicable. 2010-11-29 20:19:15 +00:00
binhex.py Issues #2715, #6621: Remove unused and undefined references to FSSpec 2009-08-02 20:08:54 +00:00
bisect.py Per the language summit, the optional fastpath imports should use from-import-star. 2009-03-31 17:51:51 +00:00
calendar.py #10092: Properly reset locale in Locale*Calendar classes. The context manager was buggy because setlocale() returns the *new* locale, not the old. Also add a test for this. 2010-10-19 18:54:25 +00:00
cgi.py #2830: add html.escape() helper and move cgi.escape() uses in the standard library to it. It defaults to quote=True and also escapes single quotes, which makes casual use safer. The cgi.escape() interface is not touched, but emits a (silent) PendingDeprecationWarning. 2010-10-15 15:57:45 +00:00
cgitb.py Merged revisions 70980,71059,71225,71234,71241,71243,71249,71251,71255,71266,71299,71329,71397-71398,71486 via svnmerge from 2009-04-11 19:48:14 +00:00
chunk.py
cmd.py Add docstring to cmd.Cmd.do_help() 2010-09-09 03:53:22 +00:00
code.py Merged revisions 75264,75268,75293,75318,75391-75392,75436,75478,75971,76003,76058,76140-76141,76231,76380,76428-76429 via svnmerge from 2009-11-25 17:46:26 +00:00
codecs.py #7475: add (un)transform method to bytes/bytearray and str, add back codecs that can be used with them from Python 2. 2010-12-02 18:06:51 +00:00
codeop.py correct porting error with raise #7161 2009-10-18 00:32:49 +00:00
collections.py Issue 9865: add __sizeof__ to OrderedDict. 2010-09-16 19:10:17 +00:00
colorsys.py Merged revisions 68884,68973,68978,69003,69083,69112-69113 via svnmerge from 2009-01-30 03:39:35 +00:00
compileall.py Fix typos and style in compileall. 2010-11-20 21:53:02 +00:00
configparser.py configparser: read-only attributes to get the section name and parser from a SectionProxy instance 2010-11-21 13:56:42 +00:00
contextlib.py Issue 9110. Adding ContextDecorator to contextlib. This enables the creation of APIs that act as decorators as well as context managers. contextlib.contextmanager changed to use ContextDecorator. 2010-06-30 12:17:50 +00:00
copy.py Fix running the copy module from the command-line (however use{ful,less} it may be). 2010-09-04 17:46:44 +00:00
copyreg.py Rename copy_reg module to copyreg. 2008-05-11 08:55:36 +00:00
cProfile.py #9428: fix running scripts from profile/cProfile with their own name and the right namespace. Same fix as for trace.py in #1690103. 2010-08-02 12:20:23 +00:00
csv.py #5975: add unix_dialect to csv module. 2010-10-27 07:27:06 +00:00
datetime.py Issue 9183: Intern UTC timezone. 2010-10-14 17:03:51 +00:00
decimal.py Add link to specification. 2010-11-30 20:32:59 +00:00
difflib.py Issue 2986: Add autojunk paramater to SequenceMatcher to turn off heuristic. Patch by Terry Reedy, Eli Bendersky, and Simon Cross 2010-11-25 06:12:34 +00:00
dis.py As per python-dev discussion with Eli, properly document and publish dis.show_code 2010-09-10 14:08:04 +00:00
doctest.py More proper closing of files 2010-10-14 22:11:44 +00:00
dummy_threading.py Rename thread to _thread and dummy_thread to _dummy_thread. Issue #2875. 2008-05-25 13:05:15 +00:00
filecmp.py Use new form of with-statement instead of contextlib.nested(). 2009-06-04 00:11:54 +00:00
fileinput.py #1286: allow using fileinput.FileInput as context manager. 2010-07-31 20:08:15 +00:00
fnmatch.py Re-apply r83871. 2010-08-13 16:26:40 +00:00
formatter.py Fix formatter usage of filter(). Bug #3800. 2008-09-09 19:31:25 +00:00
fractions.py Make Fraction(-1).__hash__() return -2 rather than -1 (see issue 10356). 2010-11-13 10:27:38 +00:00
ftplib.py fix ftplib resource warnings 2010-10-31 19:58:07 +00:00
functools.py Issue 10593: Adopt Nick's suggestion for an lru_cache with maxsize=None. 2010-12-01 03:45:41 +00:00
genericpath.py Merged revisions 78018,78035-78040,78042-78043,78046,78048-78052,78054,78059,78075-78080 via svnmerge from 2010-03-14 10:23:39 +00:00
getopt.py remove pointless coding cookies 2010-08-30 14:05:50 +00:00
getpass.py Merged revisions 76000,76016 via svnmerge from 2009-11-01 18:42:17 +00:00
gettext.py Fix interaction of custom translation classes and caching (#9042) 2010-10-04 23:52:37 +00:00
glob.py Merged revisions 70712,70714,70764-70765,70769-70771,70773,70776-70777,70788-70789,70824,70828,70832,70836,70842,70851,70855,70857,70866-70872,70883,70885,70893-70894,70896-70897,70903,70905-70907,70915,70927,70933,70951,70960,70962-70964,70998,71001,71006,71008,71010-71011,71019,71037,71056,71094,71101-71103,71106,71119,71123,71149-71150,71203,71212,71214-71217,71221,71240 via svnmerge from 2009-04-05 19:13:16 +00:00
gzip.py #10465: fix broken delegation in __getattr__ of _PaddedFile. 2010-11-20 11:25:01 +00:00
hashlib.py hashlib has two new constant attributes: algorithms_guaranteed and 2010-09-06 08:30:23 +00:00
heapq.py Remove coding cookie from heapq.py. 2010-07-04 19:23:49 +00:00
hmac.py speed hack can be implemented more cleanly by directly calling __new__ 2010-08-21 02:45:15 +00:00
imaplib.py Issue #10429: IMAP.starttls() stored the capabilities as bytes objects, 2010-11-16 17:55:26 +00:00
imghdr.py Merged revisions 65713 via svnmerge from 2008-08-16 16:48:16 +00:00
inspect.py Issue 10220: switch to using string constants rather than integers for inspect.getgeneratorstate() return values and make debugging friendly str() and repr() for generator states a requirement in the test suite 2010-11-30 06:36:04 +00:00
io.py add myself 2009-05-01 20:45:43 +00:00
keyword.py Use a context manager for some file objects. 2010-09-03 19:52:03 +00:00
linecache.py Issue #10335: Add tokenize.open(), detect the file encoding using 2010-11-09 01:08:59 +00:00
locale.py Merged revisions 80512 via svnmerge from 2010-04-27 02:45:53 +00:00
macpath.py Issue #767645: Set os.path.supports_unicode_filenames to True on Mac OS X 2010-09-13 19:41:36 +00:00
macurl2path.py Fixing Issue6711 - macurl2path has typos that raise AttributeError 2009-08-16 06:43:50 +00:00
mailbox.py Fix test_mailbox by supporting context manager protocol for get_file() returns. 2010-10-30 14:33:28 +00:00
mailcap.py Issue #8496: make mailcap.lookup() always return a list, rather than an iterator. 2010-04-22 13:30:10 +00:00
mimetypes.py Fix r85774 editor fail. Had a dangling try and incorrect indent. 2010-10-21 14:45:01 +00:00
modulefinder.py Silence ResourceWarnings in modulefinder by using file context managers. 2010-10-30 00:26:48 +00:00
netrc.py #10464: fix netrc handling of lines with embedded '#" characters. 2010-12-02 02:58:07 +00:00
nntplib.py Make usenetrc False by default (the old behaviour of having it True by 2010-11-09 18:58:42 +00:00
ntpath.py Issue #5117: Case normalization was needed on ntpath.relpath(). And 2010-10-18 12:13:18 +00:00
nturl2path.py Fix for issue1153027, making Py3k changes similar to fix in issue918368. 2009-05-05 18:41:13 +00:00
numbers.py Issue 4998: restore utility of __slots__ on Fraction. 2009-02-12 17:58:36 +00:00
opcode.py #4617: Previously it was illegal to delete a name from the local 2010-09-10 21:39:53 +00:00
optparse.py Merged revisions 71540,71563 via svnmerge from 2009-04-27 16:41:41 +00:00
os.py Issue 9299 Add exist_ok parameter to os.makedirs to suppress 'File exists' exception. Patch by Ray Allen. 2010-12-02 07:05:56 +00:00
os2emxpath.py #9018: os.path.normcase() now raises a TypeError if the argument is not str or bytes. 2010-06-25 10:56:11 +00:00
pdb.py Use booleans where applicable. 2010-11-29 20:19:15 +00:00
pickle.py Issue #9410: Various optimizations to the pickle module, leading to 2010-09-09 18:33:21 +00:00
pickletools.py Syntax cleanup. 2010-07-28 16:39:41 +00:00
pipes.py Merged revisions 80030,80067,80069,80080-80081,80084,80432-80433,80465-80470,81059,81065-81067 via svnmerge from 2010-05-19 20:57:08 +00:00
pkgutil.py Have pkgutil properly close files. 2010-10-29 22:36:53 +00:00
platform.py Use a context manager for some file objects. 2010-09-03 19:52:03 +00:00
plistlib.py Merged revisions 80279 via svnmerge from 2010-04-20 21:00:34 +00:00
poplib.py fix issue #8807: adds a context parameter to POP3_SSL class. 2010-08-17 16:09:53 +00:00
posixpath.py Issue #5117: Case normalization was needed on ntpath.relpath(). And 2010-10-18 12:13:18 +00:00
pprint.py Have pprint() respect the order in an OrderedDict. 2010-09-09 12:31:00 +00:00
profile.py #9428: fix running scripts from profile/cProfile with their own name and the right namespace. Same fix as for trace.py in #1690103. 2010-08-02 12:20:23 +00:00
pstats.py Fix issue10377 - Output from pstats - it is just secs (i.e, wallclock time) and not CPU time. 2010-11-20 17:02:50 +00:00
pty.py Issue #3782: os.write() must not accept unicode strings 2008-09-15 23:02:56 +00:00
py_compile.py Issue #10335: Add tokenize.open(), detect the file encoding using 2010-11-09 01:08:59 +00:00
pyclbr.py Merged revisions 62425-62429,62434-62436,62441,62444,62446-62448,62450-62455,62463,62465-62466,62469,62474,62476-62478,62480,62485,62492,62497-62498,62500,62507,62513-62514,62516,62521,62531,62535,62545-62546,62548-62551,62553-62559,62569,62574,62577,62593,62595,62604-62606,62608,62616,62626-62627,62636,62638,62644-62645,62647-62648,62651-62653,62656,62661,62663,62680,62686-62687,62696,62699-62703,62711 via svnmerge from 2008-05-04 22:42:01 +00:00
pydoc.py #9724: add nonlocal to pydoc topics. 2010-11-20 19:54:36 +00:00
queue.py Issue 10110: Let Queue.put recognize a full queue when the maxsize parameter has been reduced. 2010-10-31 17:57:52 +00:00
quopri.py convert shebang lines: python -> python3 2010-03-11 22:53:45 +00:00
random.py Improve variable name (don't shadow a builtin). 2010-09-08 19:27:59 +00:00
re.py Adopt more descriptive attribute names as suggested on python-dev. 2010-09-04 22:46:06 +00:00
reprlib.py Fix bootstrap issues when building without threads 2010-11-05 19:58:28 +00:00
rlcompleter.py Merged revisions 65168 via svnmerge from 2008-07-21 16:32:10 +00:00
runpy.py account for the module __dict__ being cleared when its module is 2010-10-13 01:04:36 +00:00
sched.py Merged revisions 72932 via svnmerge from 2009-05-26 07:51:03 +00:00
shelve.py Merged revisions 78141-78142 via svnmerge from 2010-02-11 02:42:19 +00:00
shlex.py Recode modules from latin-1 to utf-8 2010-10-27 18:52:48 +00:00
shutil.py Merged revisions 80837 via svnmerge from 2010-05-05 22:43:04 +00:00
site.py Issue #6612: Fix site and sysconfig to catch os.getcwd() error, eg. if the 2010-10-12 22:23:23 +00:00
smtpd.py Fix issue 6706: adds new handle_accepted() method to asyncore.dispatcher 2010-10-04 21:08:36 +00:00
smtplib.py #10321: Add support for sending binary DATA and Message objects to smtplib 2010-11-08 17:15:13 +00:00
sndhdr.py Issue #9243: Fix sndhdr module and add unit tests, contributed by James Lee. 2010-07-13 23:04:56 +00:00
socket.py Issue #10093: ResourceWarnings are now issued when files and sockets are 2010-10-29 10:38:18 +00:00
socketserver.py Merged revisions 80484 via svnmerge from 2010-04-25 22:01:43 +00:00
sre_compile.py Issue #8990: array.fromstring() and array.tostring() get renamed to 2010-09-01 20:29:34 +00:00
sre_constants.py #2834: Change re module semantics, so that str and bytes mixing is forbidden, 2008-08-19 17:56:33 +00:00
sre_parse.py #6509: fix re.sub to work properly when the pattern, the string, and the replacement were all bytes. Patch by Antoine Pitrou. 2010-03-06 15:24:08 +00:00
ssl.py Issue #5639: Add a *server_hostname* argument to SSLContext.wrap_socket 2010-10-22 18:19:07 +00:00
stat.py Merged revisions 61644,61646-61647,61649-61652,61656-61658,61663,61665,61667 via svnmerge from 2008-03-20 10:49:03 +00:00
string.py #9418: first step of moving private string methods to _string module. 2010-10-14 07:04:07 +00:00
stringprep.py
struct.py Issue #8973: Add __all__ to struct module, so that help(struct) correctly 2010-06-12 15:17:02 +00:00
subprocess.py Fix #10554. Added context manager support to Popen objects. 2010-12-03 02:46:02 +00:00
sunau.py Issue #10265: Close file objects explicitly in sunau. Patch by Brian Brazil. 2010-10-31 21:27:04 +00:00
symbol.py convert shebang lines: python -> python3 2010-03-11 22:53:45 +00:00
symtable.py simplify this expression 2009-06-28 19:30:36 +00:00
sysconfig.py sys.abiflags may not be defined on all platforms. 2010-11-25 01:34:47 +00:00
tabnanny.py Issue #10335: Add tokenize.open(), detect the file encoding using 2010-11-09 01:08:59 +00:00
tarfile.py Issue #10184: Touch directories only once when extracting a tarfile. 2010-11-01 21:39:13 +00:00
telnetlib.py #7761: fix telnetlib.interact failures on Windows. 2010-10-26 12:42:24 +00:00
tempfile.py Simplify code 2010-11-09 03:43:58 +00:00
textwrap.py Merged revisions 74779-74786,74793,74795,74811,74860-74861,74863,74876,74886,74896,74901,74903,74908,74912,74930,74933,74943,74946,74952-74955,75015,75019,75032,75068,75076,75095,75098,75102,75129,75139,75230 via svnmerge from 2009-10-04 14:49:41 +00:00
this.py
threading.py Issue 10260 2010-11-18 12:46:39 +00:00
timeit.py #4810: document "--" option separator in timeit help. 2010-08-01 08:52:32 +00:00
token.py Issue #10386: Added __all__ to token module; this simplifies importing 2010-11-11 14:07:41 +00:00
tokenize.py Issue #10386: Added __all__ to token module; this simplifies importing 2010-11-11 14:07:41 +00:00
trace.py Issue #10371: Deprecated undocumented functions in the trace module. 2010-11-20 18:21:07 +00:00
traceback.py Issue #4486: When an exception has an explicit cause, do not print its implicit context too. 2009-11-28 16:12:28 +00:00
tty.py
turtle.py Issue #6878: Fixed return type of tkinter methods documented to return lists. 2010-11-22 19:40:51 +00:00
types.py Merged revisions 62194,62197-62198,62204-62205,62214,62219-62221,62227,62229-62231,62233-62235,62237-62239 via svnmerge from 2008-04-09 08:37:03 +00:00
uu.py Issue #10266: uu.decode didn't close in_file explicitly when it was given 2010-10-31 16:04:14 +00:00
uuid.py Shift the pipe-using code into an else block, then close the pipe 2010-11-05 14:48:35 +00:00
warnings.py Add a new warning gategory, ResourceWarning, as discussed on python-dev. It is silent by default, 2010-10-24 15:11:22 +00:00
wave.py #10198: fix duplicate header when writeframes() is called with an empty string. 2010-10-30 08:29:28 +00:00
weakref.py Issue #7105: Make WeakKeyDictionary and WeakValueDictionary robust against 2010-01-08 17:54:23 +00:00
webbrowser.py Remove conditional import of 'ic', that module 2010-05-30 15:46:48 +00:00
wsgiref.egg-info
xdrlib.py
zipfile.py zipfile: remove remaining ResourceWarnings 2010-11-23 00:15:02 +00:00