cpython

mirror of https://github.com/python/cpython.git synced 2025-12-08 06:10:17 +00:00

Author	SHA1	Message	Date
Pablo Galindo Salgado	572c780aa8	gh-138122: Implement frame caching in RemoteUnwinder to reduce memory reads (#142137 ) This PR implements frame caching in the RemoteUnwinder class to significantly reduce memory reads when profiling remote processes with deep call stacks. When cache_frames=True, the unwinder stores the frame chain from each sample and reuses unchanged portions in subsequent samples. Since most profiling samples capture similar call stacks (especially the parent frames), this optimization avoids repeatedly reading the same frame data from the target process. The implementation adds a last_profiled_frame field to the thread state that tracks where the previous sample stopped. On the next sample, if the current frame chain reaches this marker, the cached frames from that point onward are reused instead of being re-read from remote memory. The sampling profiler now enables frame caching by default.	2025-12-06 22:37:34 +00:00
Ivo Bellin Salarin	9d707d8a64	gh-68552: fix defects policy (#138579 ) Extend defect handling via policy to a couple of missed defects. --------- Co-authored-by: Martin Panter <vadmium@users.noreply.github.com> Co-authored-by: Ivo Bellin Salarin <ivo@nilleb.com>	2025-12-06 16:54:29 -05:00
Pablo Galindo Salgado	ed4f78a4b3	gh-142236: Fix incorrect keyword suggestions for syntax errors (#142328 ) The keyword typo suggestion mechanism in traceback would incorrectly suggest replacements when the extracted source code was merely incomplete rather than containing an actual typo. For example, when a missing comma caused a syntax error, the system would suggest replacing 'print' with 'not' because the incomplete code snippet happened to pass validation. The fix adds a validation step that first checks whether the original extracted code raises a SyntaxError. If the code compiles successfully or is simply incomplete (compile_command returns None), the function returns early since there is no way to verify that a keyword replacement would actually fix the problem.	2025-12-06 21:09:35 +00:00
Paresh Joshi	07eff899d8	gh-142006: Fix HeaderWriteError in email.policy.default caused by extra newline (#142008 ) RDM: This fixes a subtle folding error that showed up when a token exactly filled a line and was followed by whitespace and a token with no folding whitespace that was longer than a line. In this particular circumstance the whitespace after the first token got pushed on to the next line, and then stolen to go in front of the next unfoldable token...leaving a completely empty line in the line buffer. That line got turned in to a newline, which is RFC illegal, and the newish security check caught it. The fix is to just delete that empty line from the buffer. Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com>	2025-12-06 15:59:35 -05:00
Sanyam Khurana	100e316e53	gh-69113: Fix doctest to report line numbers for __test__ strings (#141624 ) Enhanced the _find_lineno method in doctest to correctly identify and report line numbers for doctests defined in __test__ dictionaries when formatted as triple-quoted strings. Finds a non-blank line in the test string and matches it in the source file, verifying subsequent lines also match to handle duplicate lines. Previously, doctest would report "line None" for __test__ dictionary strings, making it difficult to debug failing tests. Co-authored-by: Jurjen N.E. Bos <jneb@users.sourceforge.net> Co-authored-by: R. David Murray <rdmurray@bitdance.com>	2025-12-06 15:47:08 -05:00
ivonastojanovic	c91c373ef6	gh-140677 Improve heatmap colors (#142241 ) Co-authored-by: Pablo Galindo Salgado <pablogsal@gmail.com>	2025-12-06 20:27:16 +00:00
Kaisheng Xu	14715e3a64	gh-105836: Fix `asyncio.run_coroutine_threadsafe` leaving underlying cancelled asyncio task running (#141696 ) Co-authored-by: Kumar Aditya <kumaraditya@python.org>	2025-12-06 19:33:25 +00:00
Savannah Ostrowski	56a442d0d8	GH-141565: Add async code awareness to Tachyon (#141533 ) Co-authored-by: Pablo Galindo Salgado <pablogsal@gmail.com>	2025-12-06 19:31:40 +00:00
Savannah Ostrowski	0ed56ed88f	GH-64532: Include parent's required optional arguments in subparser usage (#142355 )	2025-12-06 18:30:50 +00:00
Serhiy Storchaka	70c27ce94b	gh-142332: Fix usage formatting for positional arguments in mutually exclusive groups in argparse (GH-142333)	2025-12-06 18:03:45 +00:00
Savannah Ostrowski	5be3405e4e	GH-75949: Fix argparse dropping '\|' in mutually exclusive groups on line wrap (#142312 )	2025-12-06 15:12:21 +00:00
Y. Z. Chen	61823a5382	Docs: fix RFC index reference for TLS 1.3 (#142262 )	2025-12-06 14:05:20 +01:00
Stan Ulbrych	dcac498e50	gh-142318: Fix typing `'q'` at interactive help screen exiting Tachyon (#142319 )	2025-12-05 19:36:28 +00:00
Serhiy Storchaka	59f247e43b	gh-115952: Fix a potential virtual memory allocation denial of service in pickle (GH-119204) Loading a small data which does not even involve arbitrary code execution could consume arbitrary large amount of memory. There were three issues: * PUT and LONG_BINPUT with large argument (the C implementation only). Since the memo is implemented in C as a continuous dynamic array, a single opcode can cause its resizing to arbitrary size. Now the sparsity of memo indices is limited. * BINBYTES, BINBYTES8 and BYTEARRAY8 with large argument. They allocated the bytes or bytearray object of the specified size before reading into it. Now they read very large data by chunks. * BINSTRING, BINUNICODE, LONG4, BINUNICODE8 and FRAME with large argument. They read the whole data by calling the read() method of the underlying file object, which usually allocates the bytes object of the specified size before reading into it. Now they read very large data by chunks. Also add comprehensive benchmark suite to measure performance and memory impact of chunked reading optimization in PR #119204. Features: - Normal mode: benchmarks legitimate pickles (time/memory metrics) - Antagonistic mode: tests malicious pickles (DoS protection) - Baseline comparison: side-by-side comparison of two Python builds - Support for truncated data and sparse memo attack vectors Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> Co-authored-by: Gregory P. Smith <greg@krypto.org>	2025-12-05 19:17:01 +02:00
Serhiy Storchaka	100c726d98	Add explanation comments for tests for overlapped ZIP entries (GH-137152)	2025-12-05 18:09:20 +02:00
Serhiy Storchaka	706fdda8b3	gh-141370: Fix undefined behavior when using Py_ABS() (GH-141548) Co-authored-by: Sergey B Kirpichev <skirpichev@gmail.com>	2025-12-05 16:24:35 +02:00
Sanyam Khurana	4238a975d7	gh-48752: Add readline.get_pre_input_hook() function (#141586 ) Add readline.get_pre_input_hook() to retrieve the current pre-input hook. This allows applications to save and restore the hook without overwriting user settings.	2025-12-05 13:18:54 +01:00
Jelle Zijlstra	53ec7c8fc0	gh-142214: Fix two regressions in dataclasses (#142223 )	2025-12-04 20:04:42 -08:00
Ken Jin	b3bf212898	gh-141976: Check stack bounds in JIT optimizer (GH-142201)	2025-12-04 20:28:08 +00:00
Kir Chou	8392095bf9	gh-129483: Make `TestLocalTimeDisambiguation`'s time format locale independent (#142193 ) * Change to update %c to the exact time format. --------- Co-authored-by: Kir Chou <note351@hotmail.com>	2025-12-04 08:32:23 -05:00
Sam Gross	547d8daf78	gh-142218: Fix split table dictionary crash (gh-142229) This fixes a regression introduced in gh-140558. The interpreter would crash if we inserted a non `str` key into a split table that matches an existing key.	2025-12-03 18:37:35 -05:00
Sam Gross	c0c65141b3	gh-140482: Avoid changing terminal settings in test_pty (gh-142202) The previous test_spawn_doesnt_hang test had a few problems: * It would cause ENV CHANGED failures if other tests were running concurrently due to stty changes * Typing while the test was running could cause it to fail	2025-12-03 15:48:44 -05:00
Mark Shannon	62423c9c36	GH-141794: Limit size of generated machine code. (GH-142228) * Factor out bodies of the largest uops, to reduce jit code size. * Factor out common assert, also reducing jit code size. * Limit size of jitted code for a single executor to 1MB.	2025-12-03 17:43:35 +00:00
Petr Viktorin	4172644d78	gh-142206: multiprocessing.resource_tracker: Decode messages using older protocol (GH-142215)	2025-12-03 12:59:14 +00:00
Seth Michael Larson	08d8e18ad8	gh-142145: Remove quadratic behavior in node ID cache clearing (GH-142146) * Remove quadratic behavior in node ID cache clearing Co-authored-by: Jacob Walls <38668450+jacobtylerwalls@users.noreply.github.com> * Add news fragment --------- Co-authored-by: Jacob Walls <38668450+jacobtylerwalls@users.noreply.github.com>	2025-12-02 23:16:37 -08:00
Pablo Galindo Salgado	8801c6dec7	gh-140677 Add heatmap visualization to Tachyon sampling profiler (#140680 ) Co-authored-by: Ivona Stojanovic <stojanovic.i@hotmail.com>	2025-12-02 20:33:40 +00:00
LloydZ	fddc24e4c8	gh-141982: Fix pdb can't set breakpoints on async functions (#141983 ) Co-authored-by: Tian Gao <gaogaotiantian@hotmail.com>	2025-12-01 23:40:02 -08:00
LloydZ	5e58548ebe	gh-59000: Fix pdb breakpoint resolution for class methods when module not imported (#141949 )	2025-12-01 20:41:54 -08:00
László Kiss Kollár	f87eb4d7cd	gh-138122: New Tachyon UI (#142116 ) Co-authored-by: Pablo Galindo Salgado <pablogsal@gmail.com>	2025-12-01 17:34:14 +00:00
Serhiy Storchaka	694922cf40	gh-119342: Fix a potential denial of service in plistlib (GH-119343) Reading a specially prepared small Plist file could cause OOM because file's read(n) preallocates a bytes object for reading the specified amount of data. Now plistlib reads large data by chunks, therefore the upper limit of consumed memory is proportional to the size of the input file.	2025-12-01 17:28:15 +02:00
Serhiy Storchaka	5a4c4a033a	gh-119451: Fix a potential denial of service in http.client (GH-119454) Reading the whole body of the HTTP response could cause OOM if the Content-Length value is too large even if the server does not send a large amount of data. Now the HTTP client reads large data by chunks, therefore the amount of consumed memory is proportional to the amount of sent data.	2025-12-01 17:26:07 +02:00
Stan Ulbrych	d4fa70706c	gh-139707: Add mechanism for distributors to supply error messages for missing stdlib modules (GH-140783)	2025-12-01 14:36:17 +01:00
yihong	056d6c5ed9	gh-141999: Handle KeyboardInterrupt when sampling in the new tachyon profiler (#142000 )	2025-11-30 02:49:13 +00:00
Pablo Galindo Salgado	ea51e745c7	gh-138122: Add thread status statistics to flamegraph profiler (#141900 ) Co-authored-by: ivonastojanovic <80911834+ivonastojanovic@users.noreply.github.com>	2025-11-30 01:42:39 +00:00
Victor Stinner	5e749d3743	Fix multiprocessing queue test_get() (GH-142024) * Replace sleep() with support.sleeping_retry(). * Test get_nowait() first. * Restore previously disabled test. Fix the failure: FAIL: test_get (test.test_multiprocessing_spawn.test_processes.WithProcessesTestQueue.test_get) ---------------------------------------------------------------------- Traceback (most recent call last): File "Lib/test/_test_multiprocessing.py", line 1208, in test_get self.assertEqual(queue_empty(queue), False) ~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^ AssertionError: True != False	2025-11-28 23:00:14 -08:00
Gregory P. Smith	5b1862bdd8	gh-87512: Fix `subprocess` using `timeout=` on Windows blocking with a large `input=` (GH-142058) On Windows, Popen._communicate() previously wrote to stdin synchronously, which could block indefinitely if the subprocess didn't consume input= quickly and the pipe buffer filled up. The timeout= parameter was only checked when joining the reader threads, not during the stdin write. This change moves the Windows stdin writing to a background thread (similar to how stdout/stderr are read in threads), allowing the timeout to be properly enforced. If timeout expires, TimeoutExpired is raised promptly and the writer thread continues in the background. Subsequent calls to communicate() will join the existing writer thread. Adds test_communicate_timeout_large_input to verify that TimeoutExpired is raised promptly when communicate() is called with large input and a timeout, even when the subprocess doesn't consume stdin quickly. This test already passed on POSIX (where select() is used) but failed on Windows where the stdin write blocks without checking the timeout. Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-11-28 22:07:03 -08:00
Gregory P. Smith	923056b2d4	gh-74389: gh-70560: subprocess.Popen.communicate() now ignores stdin.flush error when closed (GH-142061) gh-70560: gh-74389: subprocess.Popen.communicate() now ignores stdin.flush error when closed with a unittest and news entry.	2025-11-29 05:03:06 +00:00
Gregory P. Smith	cc6bc4c97f	GH-134453: Fix subprocess memoryview input handling on POSIX (GH-134949) Fix inconsistent subprocess.Popen.communicate() behavior between Windows and POSIX when using memoryview objects with non-byte elements as input. On POSIX systems, the code was incorrectly comparing bytes written against element count instead of byte count, causing data truncation for large inputs with non-byte element types. Changes: - Cast memoryview inputs to byte view when input is already a memoryview - Fix progress tracking to use len(input_view) instead of len(self._input) - Add comprehensive test coverage for memoryview inputs 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * old-man-yells-at-ReST * Update 2025-05-30-18-37-44.gh-issue-134453.kxkA-o.rst * assertIsNone review feedback * fix memoryview_nonbytes test to fail without our fix on main, and have a nicer error. Thanks to Peter Bierma @ZeroIntensity for the code review.	2025-11-29 04:25:06 +00:00
Artur Jamro	526d7a8bb4	gh-141473: Fix subprocess.Popen.communicate to send input to stdin upon a subsequent post-timeout call (GH-141477) * gh-141473: Fix subprocess.Popen.communicate to send input to stdin * Docs: Clarify that `input` is one time only on `communicate()` * NEWS entry * Add a regression test. --------- Co-authored-by: Gregory P. Smith <greg@krypto.org>	2025-11-28 18:04:52 -08:00
Stefano Rivera	656a64b37f	gh-141930: Use the regular IO stack to write .pyc files for a better error message on failure (GH-141931) * Use open() to write the bytecode * Convert to unittest style asserts * Tweak news, thanks @vstinner * Tidy * reword NEWS, avoid word "retried"	2025-11-27 19:17:59 +00:00
Miro Hrončok	69f54ce452	gh-140210: Make test_sysconfig.test_parse_makefile_renamed_vars ignore environment variables (#140213 ) The test did not expect it could be run with e.g. CFLAGS set to a custom value.	2025-11-27 10:00:02 -08:00
Alper	bc9e63dd9d	gh-116738: Fix thread-safety issue in re module for free threading (gh-141923) Added atomic operations to `scanner_begin()` and `scanner_end()` to prevent race conditions on the `executing` flag in free-threaded builds. Also added tests for concurrent usage of the `re` module. Without the atomic operations, `test_scanner_concurrent_access()` triggers `assert(self->executing)` failures, or a thread sanitizer run emits errors.	2025-11-26 15:40:45 -05:00
Sergey Miryanov	2ea67caf31	GH-141861: Fix TRACE_RECORD if full (GH-141959)	2025-11-26 14:32:30 +00:00
Itamar Oren	27f62eb711	gh-140011: Delete importdl assertion that prevents importing embedded modules from packages (GH-141605)	2025-11-26 14:12:49 +01:00
Petr Viktorin	226011ba12	gh-139165: Make Py_SIZE, Py_IS_TYPE,Py_ SET_SIZE regular functions in stable ABI (GH-139166) * Make Py_{SIZE,IS_TYPE,SET_SIZE} regular functions in stable ABI Group them together with Py_TYPE & Py_SET_TYPE to cut down on repetitive preprocessor macros. Format repetitive definitions in object.c more concisely. Py_SET_TYPE is still left out of the Limited API.	2025-11-25 14:30:33 +01:00
Krishna Chaitanya	e6174ee981	gh-140911: Ensure that UserString.index() and UserString.rindex() accept UserString as argument (GH-140945)	2025-11-25 15:25:46 +02:00
Paresh Joshi	da1d468bea	gh-141781: Fix pdb.line_prefix binding (#141779 )	2025-11-24 18:45:16 -08:00
Sergey Miryanov	dc62b62252	GH-141861: Fix invalid memory read in the ENTER_EXECUTOR (GH-141921)	2025-11-24 22:07:45 +00:00
SubbaraoGarlapati	369ce2b139	Fix implicit import in `test_monitoring.py` (gh-141795)	2025-11-24 14:48:28 -05:00
Christian Marangi	fee7782650	gh-141907: Better handle support for SHA3 for test_hashlib (GH-141908) * test_hashlib: better handle support for SHA3 It's possible that the SSL library supports only SHA3 algo and doesn't have SHAKE one. The current test wrongly detect this and set both HASH and HASHXOF to None expecting to have the extra SHA3 attributes present but this should only be true for SHAKE algo. To better handle this, move the HASH condition to a dedicated try-expect condition and check if HASHXOF is None in the relevant code effectively checking if SHA3 is supported by the SSL library but SHAKE algo needs to use the sha3module one. Signed-off-by: Christian Marangi <ansuelsmth@gmail.com> * rework the conditional import for all its attrs --------- Signed-off-by: Christian Marangi <ansuelsmth@gmail.com> Co-authored-by: Gregory P. Smith <greg@krypto.org>	2025-11-24 17:35:58 +00:00

1 2 3 4 5 ...

34674 commits