mirror of
https://github.com/python/cpython.git
synced 2025-12-31 12:33:28 +00:00
The unknown-8bit trick was designed to deal with unknown bytes in an ASCII message, and it works fine for that. However, I also tried to extend it to handle bytes that can't be decoded using the charset specified in an encoded word, and there it fails because there can be other non-ASCII characters that were *successfully* decoded. The fix is simple: do the unknown-8bit encoding using the utf-8 codec. This is especially appropriate since anyone trying to do recovery on an unknown byte string will probably attempt utf-8 first. |
||
|---|---|---|
| .. | ||
| data | ||
| __init__.py | ||
| __main__.py | ||
| test__encoded_words.py | ||
| test__header_value_parser.py | ||
| test_asian_codecs.py | ||
| test_contentmanager.py | ||
| test_defect_handling.py | ||
| test_email.py | ||
| test_generator.py | ||
| test_headerregistry.py | ||
| test_inversion.py | ||
| test_message.py | ||
| test_parser.py | ||
| test_pickleable.py | ||
| test_policy.py | ||
| test_utils.py | ||
| torture_test.py | ||