cpython

mirror of https://github.com/python/cpython.git synced 2025-12-08 06:10:17 +00:00

Author	SHA1	Message	Date
Gregory P. Smith using claude.ai/code	d420f29e2b	Fix _communicate_streams_windows to avoid blocking with large input Move stdin writing to a background thread in _communicate_streams_windows to avoid blocking indefinitely when writing large input to a pipeline where the subprocess doesn't consume stdin quickly. This mirrors the fix made to Popen._communicate() for Windows in commit `5b1862b` (gh-87512). Add test_pipeline_timeout_large_input to verify that TimeoutExpired is raised promptly when run_pipeline() is called with large input and a timeout, even when the first process is slow to consume stdin. Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-11-29 08:41:25 +00:00
Gregory P. Smith using claude.ai/code	9f53a8e883	Refactor POSIX communicate I/O into shared _communicate_io_posix() Extract the core selector-based I/O loop into a new _communicate_io_posix() function that is shared by both _communicate_streams_posix() (used by run_pipeline) and Popen._communicate() (used by Popen.communicate). The new function: - Takes a pre-configured selector and output buffers - Supports resume via input_offset parameter (for Popen timeout retry) - Returns (new_offset, completed) instead of raising TimeoutExpired - Does not close streams (caller decides based on use case) This reduces code duplication and ensures both APIs use the same well-tested I/O multiplexing logic. Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-11-29 08:12:12 +00:00
Gregory P. Smith using claude.ai/code	3c28ed6e93	Remove obsolete XXX comment about non-blocking I/O The comment suggested rewriting Popen._communicate() to use non-blocking I/O on file objects now that Python 3's io module is used instead of C stdio. This is unnecessary - the current approach using select() to detect ready fds followed by os.read()/os.write() is correct and efficient. The selector already solves "when is data ready?" so non-blocking mode would add complexity with no benefit. Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-11-29 08:04:35 +00:00
Gregory P. Smith using claude.ai/code	a3e98a73be	Improve test_pipeline_large_data_with_stderr to use large stderr Update the test to write 64KB to stderr from each process (128KB total) instead of just small status messages. This better tests that the multiplexed I/O handles concurrent large data on both stdout and stderr without deadlocking. Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-11-29 08:04:35 +00:00
Gregory P. Smith using claude.ai/code	e22d1da9bc	Simplify _communicate_streams() to only accept file objects Remove support for raw file descriptors in _communicate_streams(), requiring all streams to be file objects. This simplifies both the Windows and POSIX implementations by removing isinstance() checks and fd-wrapping logic. The run_pipeline() function now wraps the stderr pipe's read end with os.fdopen() immediately after creation. This change makes _communicate_streams() more compatible with Popen.communicate() which already uses file objects, enabling potential future refactoring to share the multiplexed I/O logic. Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-11-29 08:04:34 +00:00
Gregory P. Smith using claude.ai/code	2470e14a70	Add deadlock prevention tests for run_pipeline() Add three tests that verify the multiplexed I/O implementation properly handles large data volumes that would otherwise cause pipe buffer deadlocks: - test_pipeline_large_data_no_deadlock: 256KB through 2-stage pipeline - test_pipeline_large_data_three_stages: 128KB through 3-stage pipeline - test_pipeline_large_data_with_stderr: 64KB with concurrent stderr These tests would timeout or deadlock without proper multiplexing. Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-11-29 08:04:34 +00:00
Gregory P. Smith using claude.ai/code	2a11d4bf53	Refactor run_pipeline() to use multiplexed I/O Add _communicate_streams() helper function that properly multiplexes read/write operations to prevent pipe buffer deadlocks. The helper uses selectors on POSIX and threads on Windows, similar to Popen.communicate(). This fixes potential deadlocks when large amounts of data flow through the pipeline and significantly improves performance. Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-11-29 08:04:34 +00:00
Gregory P. Smith using claude.ai/code	e3a2fbe6da	Add subprocess.run_pipeline() for command pipe chaining Add a new run_pipeline() function to the subprocess module that enables running multiple commands connected via pipes, similar to shell pipelines. New API: - run_pipeline(*commands, ...) - Run a pipeline of commands - PipelineResult - Return type with commands, returncodes, stdout, stderr - PipelineError - Raised when check=True and any command fails Features: - Supports arbitrary number of commands (minimum 2) - capture_output, input, timeout, and check parameters like run() - stdin= connects to first process, stdout= connects to last process - Text mode support via text=True, encoding, errors - All processes share a single stderr pipe for simplicity - "pipefail" semantics: check=True fails if any command fails Unlike run(), this function does not accept universal_newlines. Use text=True instead. Example: result = subprocess.run_pipeline( ['cat', 'file.txt'], ['grep', 'pattern'], ['wc', '-l'], capture_output=True, text=True ) Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-11-29 08:04:33 +00:00
Victor Stinner	5e749d3743	Fix multiprocessing queue test_get() (GH-142024) * Replace sleep() with support.sleeping_retry(). * Test get_nowait() first. * Restore previously disabled test. Fix the failure: FAIL: test_get (test.test_multiprocessing_spawn.test_processes.WithProcessesTestQueue.test_get) ---------------------------------------------------------------------- Traceback (most recent call last): File "Lib/test/_test_multiprocessing.py", line 1208, in test_get self.assertEqual(queue_empty(queue), False) ~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^ AssertionError: True != False	2025-11-28 23:00:14 -08:00
Gregory P. Smith	5b1862bdd8	gh-87512: Fix `subprocess` using `timeout=` on Windows blocking with a large `input=` (GH-142058) On Windows, Popen._communicate() previously wrote to stdin synchronously, which could block indefinitely if the subprocess didn't consume input= quickly and the pipe buffer filled up. The timeout= parameter was only checked when joining the reader threads, not during the stdin write. This change moves the Windows stdin writing to a background thread (similar to how stdout/stderr are read in threads), allowing the timeout to be properly enforced. If timeout expires, TimeoutExpired is raised promptly and the writer thread continues in the background. Subsequent calls to communicate() will join the existing writer thread. Adds test_communicate_timeout_large_input to verify that TimeoutExpired is raised promptly when communicate() is called with large input and a timeout, even when the subprocess doesn't consume stdin quickly. This test already passed on POSIX (where select() is used) but failed on Windows where the stdin write blocks without checking the timeout. Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2025-11-28 22:07:03 -08:00
Gregory P. Smith	923056b2d4	gh-74389: gh-70560: subprocess.Popen.communicate() now ignores stdin.flush error when closed (GH-142061) gh-70560: gh-74389: subprocess.Popen.communicate() now ignores stdin.flush error when closed with a unittest and news entry.	2025-11-29 05:03:06 +00:00
Gregory P. Smith	cc6bc4c97f	GH-134453: Fix subprocess memoryview input handling on POSIX (GH-134949) Fix inconsistent subprocess.Popen.communicate() behavior between Windows and POSIX when using memoryview objects with non-byte elements as input. On POSIX systems, the code was incorrectly comparing bytes written against element count instead of byte count, causing data truncation for large inputs with non-byte element types. Changes: - Cast memoryview inputs to byte view when input is already a memoryview - Fix progress tracking to use len(input_view) instead of len(self._input) - Add comprehensive test coverage for memoryview inputs 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * old-man-yells-at-ReST * Update 2025-05-30-18-37-44.gh-issue-134453.kxkA-o.rst * assertIsNone review feedback * fix memoryview_nonbytes test to fail without our fix on main, and have a nicer error. Thanks to Peter Bierma @ZeroIntensity for the code review.	2025-11-29 04:25:06 +00:00
Artur Jamro	526d7a8bb4	gh-141473: Fix subprocess.Popen.communicate to send input to stdin upon a subsequent post-timeout call (GH-141477) * gh-141473: Fix subprocess.Popen.communicate to send input to stdin * Docs: Clarify that `input` is one time only on `communicate()` * NEWS entry * Add a regression test. --------- Co-authored-by: Gregory P. Smith <greg@krypto.org>	2025-11-28 18:04:52 -08:00
Cody Maloney	5a7c9c6861	gh-141968: Use take_bytes in encodings.punycode (#141974 ) Removes a copy going from bytearray to bytes. Co-authored-by: Victor Stinner <vstinner@python.org> Co-authored-by: Bénédikt Tran <10796600+picnixz@users.noreply.github.com>	2025-11-28 17:47:14 +00:00
Cody Maloney	3001464248	gh-141968: Use take_bytes in re._compiler (#141995 ) Removes a copy going from bytearray to bytes. Co-authored-by: Victor Stinner <vstinner@python.org> Co-authored-by: Bénédikt Tran <10796600+picnixz@users.noreply.github.com>	2025-11-28 17:46:10 +00:00
Stefano Rivera	656a64b37f	gh-141930: Use the regular IO stack to write .pyc files for a better error message on failure (GH-141931) * Use open() to write the bytecode * Convert to unittest style asserts * Tweak news, thanks @vstinner * Tidy * reword NEWS, avoid word "retried"	2025-11-27 19:17:59 +00:00
Miro Hrončok	69f54ce452	gh-140210: Make test_sysconfig.test_parse_makefile_renamed_vars ignore environment variables (#140213 ) The test did not expect it could be run with e.g. CFLAGS set to a custom value.	2025-11-27 10:00:02 -08:00
Alper	bc9e63dd9d	gh-116738: Fix thread-safety issue in re module for free threading (gh-141923) Added atomic operations to `scanner_begin()` and `scanner_end()` to prevent race conditions on the `executing` flag in free-threaded builds. Also added tests for concurrent usage of the `re` module. Without the atomic operations, `test_scanner_concurrent_access()` triggers `assert(self->executing)` failures, or a thread sanitizer run emits errors.	2025-11-26 15:40:45 -05:00
Cody Maloney	9ac14288d7	gh-141968: use `bytearray.take_bytes` in `encodings.idna` (#141975 )	2025-11-26 21:16:25 +05:30
Cody Maloney	9dbf77beb6	gh-141968: use `bytearray.take_bytes` in `wave._byteswap` (#141973 )	2025-11-26 21:15:12 +05:30
Cody Maloney	2c1fdf3592	gh-141968: Use `bytearray.take_bytes` in `base64` `_b32encode` and `_b32decode` (#141971 )	2025-11-26 21:14:25 +05:30
Sergey Miryanov	2ea67caf31	GH-141861: Fix TRACE_RECORD if full (GH-141959)	2025-11-26 14:32:30 +00:00
Itamar Oren	27f62eb711	gh-140011: Delete importdl assertion that prevents importing embedded modules from packages (GH-141605)	2025-11-26 14:12:49 +01:00
Petr Viktorin	226011ba12	gh-139165: Make Py_SIZE, Py_IS_TYPE,Py_ SET_SIZE regular functions in stable ABI (GH-139166) * Make Py_{SIZE,IS_TYPE,SET_SIZE} regular functions in stable ABI Group them together with Py_TYPE & Py_SET_TYPE to cut down on repetitive preprocessor macros. Format repetitive definitions in object.c more concisely. Py_SET_TYPE is still left out of the Limited API.	2025-11-25 14:30:33 +01:00
Krishna Chaitanya	e6174ee981	gh-140911: Ensure that UserString.index() and UserString.rindex() accept UserString as argument (GH-140945)	2025-11-25 15:25:46 +02:00
Paresh Joshi	da1d468bea	gh-141781: Fix pdb.line_prefix binding (#141779 )	2025-11-24 18:45:16 -08:00
Sergey Miryanov	dc62b62252	GH-141861: Fix invalid memory read in the ENTER_EXECUTOR (GH-141921)	2025-11-24 22:07:45 +00:00
SubbaraoGarlapati	369ce2b139	Fix implicit import in `test_monitoring.py` (gh-141795)	2025-11-24 14:48:28 -05:00
Christian Marangi	fee7782650	gh-141907: Better handle support for SHA3 for test_hashlib (GH-141908) * test_hashlib: better handle support for SHA3 It's possible that the SSL library supports only SHA3 algo and doesn't have SHAKE one. The current test wrongly detect this and set both HASH and HASHXOF to None expecting to have the extra SHA3 attributes present but this should only be true for SHAKE algo. To better handle this, move the HASH condition to a dedicated try-expect condition and check if HASHXOF is None in the relevant code effectively checking if SHA3 is supported by the SSL library but SHAKE algo needs to use the sha3module one. Signed-off-by: Christian Marangi <ansuelsmth@gmail.com> * rework the conditional import for all its attrs --------- Signed-off-by: Christian Marangi <ansuelsmth@gmail.com> Co-authored-by: Gregory P. Smith <greg@krypto.org>	2025-11-24 17:35:58 +00:00
Sam Gross	71126ab19c	gh-129441: Fix some flakiness in test_instrumentation (gh-141881) Most of the `self.assertTrue(self.called)` checks are flaky because the worker threads may sometimes finish before the main thread calls `self.during_threads()`.	2025-11-24 11:19:07 -05:00
Cody Maloney	b484c32d0a	gh-141863: use `bytearray.take_bytes` in asyncio streams for better performance (#141864 )	2025-11-24 21:06:53 +05:30
Pablo Galindo Salgado	04e3670f6b	gh-138122: Fix permission error detection in tachyon's test (#141897 )	2025-11-24 14:32:17 +00:00
Petr Viktorin	bf66bce4ee	gh-141780: Make PyModule_FromSlotsAndSpec enable GIL if needed (GH-141785)	2025-11-24 13:26:35 +01:00
Pablo Galindo Salgado	3eec46d3c3	gh-138122: Refactor the CLI of profiling.sampling into subcommands (#141813 )	2025-11-24 11:45:08 +00:00
Duane Griffin	425f24e4fa	gh-135335: Simplify preload regression test using __main__ (GH-138686) Simplify preload regression test using `__main__` With the fix for gh-126631 `__main__` modules can be preloaded and the regression test for gh-135335 can be simplified to just use a self-contained script rather than requiring a module. Note this assumes and implicitly tests that `__main__` is preloaded by default.	2025-11-23 15:37:15 -08:00
yihong	ecb901dd87	Drop three unused imports (#141875 )	2025-11-23 16:33:05 +00:00
Sam Gross	e457d60daa	gh-120158: Fix inconsistent monitoring state when setting events too frequently (gh-141845) If we overflowed the global version counter (i.e., after 2*24 calls to `_PyMonitoring_SetEvents`), we bailed out after setting global monitoring events but before instrumenting code objects, which led to assertion errors later on. Also add a `time.sleep()` to `test_free_threading.test_monitoring` to avoid overflowing the global version counter.	2025-11-23 10:07:17 -05:00
Gregory P. Smith	614a28b3da	gh-98552: Revert (unneeded, already done elsewhere) "flush std streams in the multiprocessing forkserver before fork (#141849 )" (#141871 ) Revert (unneeded, already done elsewhere) "gh-98552: flush std streams in the multiprocessing forkserver before fork (#141849)" This reverts commit `58badb1711`.	2025-11-23 09:52:26 +00:00
Hugo van Kemenade	e73fbbacbb	gh-141679: Add colour to defaults in `argparse` help (#141680 ) Co-authored-by: Savannah Ostrowski <savannah@python.org>	2025-11-23 00:26:50 +00:00
Brandt Bucher	227b9d326e	GH-140638: Add a GC "candidates" stat (GH-141814)	2025-11-22 21:59:14 +00:00
Serhiy Storchaka	425fd85ca3	gh-138525: Support single-dash long options and prefix_chars in BooleanOptionalAction (GH-138692) -nofoo is generated for -foo. ++no-foo is generated for ++foo. /nofoo is generated for /foo.	2025-11-22 20:54:02 +00:00
Gregory P. Smith	58badb1711	gh-98552: flush std streams in the multiprocessing forkserver before fork (#141849 ) * flush std streams in the multiprocessing forkserver before fork * NEWS	2025-11-22 18:29:51 +00:00
Andrew J. Hesford	08477dbf30	gh-141600: Fix musl version detection on Void Linux (GH-141602)	2025-11-22 12:17:40 -06:00
Semyon Moroz	5c25bc5763	gh-131178: Add tests for `pickletools` command-line interface (#131287 )	2025-11-22 19:17:06 +02:00
Yongzi Li	dc9d2eea58	gh-141833: Remove the bad lines in `test_attr_promotion_failure` (GH-141841)	2025-11-23 00:27:05 +08:00
Pablo Galindo Salgado	92972aea0f	gh-140381: Make test_profiling tests deterministic to fix flakiness (#141731 )	2025-11-21 21:36:30 +00:00
Sam Gross	2d50dd242e	gh-137422: Fix race condition in PyImport_AddModuleRef (gh-141822)	2025-11-21 13:30:33 -05:00
Cody Maloney	019c315a8e	gh-129559: add `bytearray.resize` thread safety test for free-threading (#141739 )	2025-11-21 23:42:22 +05:30
Mikhail Efimov	a3b78a3ade	gh-141498: Change backoff counter to use prime numbers instead of powers of 2 (GH-141591)	2025-11-21 17:00:25 +00:00
Alper	fb26d9c2ef	gh-116738: Make csv module thread-safe (gh-141365) Added a critical section to protect the states of `ReaderObj` and `WriterObj` in the free-threading build. Without the critical sections, both new free-threading tests were crashing.	2025-11-21 11:22:31 -05:00

1 2 3 4 5 ...

56057 commits