cpython

mirror of https://github.com/python/cpython.git synced 2025-10-31 21:51:50 +00:00

Author	SHA1	Message	Date
Miss Islington (bot)	00ee14e814	[3.9] bpo-45820: Fix a segfault when the parser fails without reading any input (GH-29580) (GH-29584) Co-authored-by: Pablo Galindo Salgado <Pablogsal@gmail.com> Co-authored-by: Łukasz Langa <lukasz@langa.pl>	2021-11-18 01:24:43 +01:00
Pablo Galindo Salgado	0ef308a289	bpo-45822: Respect PEP 263's coding cookies in the parser even if flags are not provided (GH-29582) (GH-29585) (cherry picked from commit `da20d7401d`)	2021-11-18 00:18:16 +01:00
Pablo Galindo Salgado	142fcb40b6	bpo-45738: Fix computation of error location for invalid continuation characters in the parser (GH-29550) (GH-29552) (cherry picked from commit `25835c518a`)	2021-11-14 01:47:27 +00:00
Łukasz Langa	88f4ec88e2	[3.9] bpo-45494: Fix parser crash when reporting errors involving invalid continuation characters (GH-28993) (#29071 ) There are two errors that this commit fixes: * The parser was not correctly computing the offset and the string source for E_LINECONT errors due to the incorrect usage of strtok(). * The parser was not correctly unwinding the call stack when a tokenizer exception happened in rules involving optionals ('?', [...]) as we always make them return valid results by using the comma operator. We need to check first if we don't have an error before continuing.. (cherry picked from commit `a106343f63`) Co-authored-by: Pablo Galindo Salgado <Pablogsal@gmail.com> NOTE: unlike the cherry-picked original, this commit points at a crazy location due to a bug in the tokenizer that required a big refactor in 3.10 to fix. We are leaving as-is for 3.9.	2021-10-20 18:51:13 +02:00
Pablo Galindo	0d0a9eaa82	[3.9] bpo-44409: Fix error location in tokenizer errors that happen during initialization (GH-26712). (GH-26723) (cherry picked from commit `507ed6fa1d`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2021-06-14 18:07:51 +01:00
Erlend Egeberg Aasland	76d270ec2b	[3.9] bpo-43779: Fix possible refleak involving _PyArena_AddPyObject (GH-25289). (GH-25294) * [3.9] Fix possible refleak involving _PyArena_AddPyObject (GH-25289). (cherry picked from commit `c0e11a3ceb`) Co-authored-by: Erlend Egeberg Aasland <erlend.aasland@innova.no> * Update Parser/pegen/pegen.c Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2021-04-09 18:46:32 +01:00
Miss Islington (bot)	994a519915	bpo-43555: Report the column offset for invalid line continuation character (GH-24939) (#24975 ) (cherry picked from commit `96eeff5162`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com> Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2021-03-22 19:07:05 +00:00
Pablo Galindo	ddcd57e3ea	[3.9] bpo-42214: Fix check for NOTEQUAL token in the PEG parser for the barry_as_flufl rule (GH-23048) (GH-23051) (cherry picked from commit `06f8c3328d`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-10-31 00:40:42 +00:00
Lysandros Nikolaou	24a7c298d4	[3.9] bpo-42123: Run the parser two times and only enable invalid rules on the second run (GH-22111) (GH-23011) * Implement running the parser a second time for the errors messages The first parser run is only responsible for detecting whether there is a `SyntaxError` or not. If there isn't the AST gets returned. Otherwise, the parser is run a second time with all the `invalid_*` rules enabled so that all the customized error messages get produced. (cherry picked from commit `bca7014032`)	2020-10-28 02:14:15 +02:00
Miss Skeleton (bot)	0b290dd217	bpo-42150: Avoid buffer overflow in the new parser (GH-22978) (cherry picked from commit `e68c67805e`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-10-25 16:24:56 -07:00
Pablo Galindo	be17295280	[3.9] bpo-41697: Correctly handle KeywordOrStarred when parsing arguments in the parser (GH-22077) (GH-22079) (cherry picked from commit `315a61f7a9`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-09-03 16:35:17 +01:00
Pablo Galindo	8de34cdb95	[3.9] bpo-41690: Use a loop to collect args in the parser instead of recursion (GH-22053) (GH-22067) This program can segfault the parser by stack overflow: ``` import ast code = "f(" + ",".join(['a' for _ in range(100000)]) + ")" print("Ready!") ast.parse(code) ``` the reason is that the rule for arguments has a simple recursion when collecting args: args[expr_ty]: [...] \| a=named_expression b=[',' c=args { c }] { [...] }. (cherry picked from commit `4a97b1517a`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-09-02 21:30:51 +01:00
Pablo Galindo	bc2c0e9a57	[3.9] Validate the AST produced by the parser in debug mode (GH-21643) (GH-21646) This will improve the debug experience if something fails in the produced AST. Previously, errors in the produced AST can be felt much later like in the garbage collector or the compiler, making debugging them much more difficult.. (cherry picked from commit `1332226b32`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-07-28 00:12:31 +01:00
Miss Islington (bot)	edeaf61b68	bpo-41215: Make assertion in the new parser more strict (GH-21364) (cherry picked from commit `782f44b8fb`) Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com>	2020-07-06 16:35:10 -07:00
Pablo Galindo	54f115dd53	[3.9] bpo-41215: Don't use NULL by default in the PEG parser keyword list (GH-21355) (GH-21356) (cherry picked from commit `39e76c0fb0`) Co-authored-by: Pablo Galindo <pablogsal@gmail.com> Automerge-Triggered-By: @lysnikolaou	2020-07-06 12:29:59 -07:00
Guido van Rossum	2a1ee1d970	[3.9] bpo-35975: Only use cf_feature_version if PyCF_ONLY_AST in cf_flags (#21022 )	2020-06-27 17:34:30 -07:00
Pablo Galindo	dab533d0ee	[3.9] bpo-41076: Pre-feed the parser with the f-string expression location (GH-21054) (GH-21190) This commit changes the parsing of f-string expressions with the new parser. The parser gets pre-fed with the location of the expression itself (not the f-string, which was what we were doing before). This allows us to completely skip the shifting of the AST nodes after the parsing is completed.. (cherry picked from commit `1f0f4abb11`)	2020-06-28 01:15:28 +01:00
Miss Islington (bot)	cb0dc52d37	bpo-41084: Adjust message when an f-string expression causes a SyntaxError (GH-21084) Prefix the error message with `fstring: `, when parsing an f-string expression throws a `SyntaxError`. (cherry picked from commit `2e0a920e9e`) Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com>	2020-06-27 12:43:49 -07:00
Miss Islington (bot)	c9f83c173b	bpo-40958: Avoid 'possible loss of data' warning on Windows (GH-20970) (cherry picked from commit `861efc6e8f`) Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com>	2020-06-20 10:35:03 -07:00
Lysandros Nikolaou	a5442b26f4	[3.9] bpo-40334: Produce better error messages on invalid targets (GH-20106) (GH-20973) * bpo-40334: Produce better error messages on invalid targets (GH-20106) The following error messages get produced: - `cannot delete ...` for invalid `del` targets - `... is an illegal 'for' target` for invalid targets in for statements - `... is an illegal 'with' target` for invalid targets in with statements Additionally, a few `cut`s were added in various places before the invocation of the `invalid_*` rule, in order to speed things up. Co-authored-by: Pablo Galindo <Pablogsal@gmail.com> (cherry picked from commit `01ece63d42`)	2020-06-19 01:03:58 +01:00
Miss Islington (bot)	7795ae8f05	bpo-40958: Avoid buffer overflow in the parser when indexing the current line (GH-20875) (GH-20919) (cherry picked from commit `51c5896b62`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-06-16 18:36:59 +01:00
Pablo Galindo	30b59fd7cf	[3.9] Improve readability and style in parser files (GH-20884) (GH-20885) (cherry picked from commit `fb61c42`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-06-15 15:08:00 +01:00
Miss Islington (bot)	8df4f3942f	bpo-40903: Handle multiple '=' in invalid assignment rules in the PEG parser (GH-20697) Automerge-Triggered-By: @pablogsal (cherry picked from commit `9f495908c5`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-06-08 02:22:06 -07:00
Miss Islington (bot)	15fec5627a	bpo-40880: Fix invalid read in newline_in_string in pegen.c (GH-20666) * bpo-40880: Fix invalid read in newline_in_string in pegen.c * Update Parser/pegen/pegen.c Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com> * Add NEWS entry Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com> (cherry picked from commit `2e6593db00`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-06-05 17:13:14 -07:00
Lysandros Nikolaou	c011d1b5be	[3.9] Backport GH-20440: Set p->error_indicator in more places (GH-20457)	2020-05-27 21:20:43 +01:00
Lysandros Nikolaou	1bfe659ee5	[3.9] Backport GH-20370 and GH-20436: Soft keywords (GH-20458)	2020-05-27 21:20:07 +01:00
Miss Islington (bot)	82da2c3eb4	bpo-40750: Support -d flag in the new parser (GH-20340) (cherry picked from commit `800a35c623`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-05-25 10:58:03 -07:00
Miss Islington (bot)	11fb605cb8	Use Py_ssize_t for the column number in the PEG support code (GH-20341) (cherry picked from commit `b23d7adfdf`) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-05-23 22:20:44 -07:00
Miss Islington (bot)	55c8923524	bpo-40334: Produce better error messages for non-parenthesized genexps (GH-20153) The error message, generated for a non-parenthesized generator expression in function calls, was still the generic `invalid syntax`, when the generator expression wasn't appearing as the first argument in the call. With this patch, even on input like `f(a, b, c for c in d, e)`, the correct error message gets produced. (cherry picked from commit `ae14583302`) Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com>	2020-05-21 18:14:55 -07:00
Lysandros Nikolaou	75b863aa97	bpo-40334: Reproduce error message for type comments on bare '*' in the new parser (GH-20151)	2020-05-18 20:14:47 +01:00
Pablo Galindo	16ab07063c	bpo-40334: Correctly identify invalid target in assignment errors (GH-20076) Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com>	2020-05-15 02:04:52 +01:00
Pablo Galindo	bcc3036095	bpo-40619: Correctly handle error lines in programs without file mode (GH-20090)	2020-05-14 21:11:48 +01:00
Lysandros Nikolaou	a15c9b3a05	bpo-40334: Always show the caret on SyntaxErrors (GH-20050) This commit fixes SyntaxError locations when the caret is not displayed, by doing the following: - `col_number` always gets set to the location of the offending node/expr. When no caret is to be displayed, this gets achieved by setting the object holding the error line to None. - Introduce a new function `_PyPegen_raise_error_known_location`, which can be called, when an arbitrary `lineno`/`col_offset` needs to be passed. This function then gets used in the grammar (through some new macros and inline functions) so that SyntaxError locations of the new parser match that of the old.	2020-05-13 20:36:27 +01:00
Serhiy Storchaka	74ea6b5a75	bpo-40593: Improve syntax errors for invalid characters in source code. (GH-20033)	2020-05-12 12:42:04 +03:00
Pablo Galindo	5b956ca42d	bpo-40585: Normalize errors messages in codeop when comparing them (GH-20030) With the new parser, the error message contains always the trailing newlines, causing the comparison of the repr of the error messages in codeop to fail. This commit makes the new parser mirror the old parser's behaviour regarding trailing newlines.	2020-05-11 01:41:26 +01:00
Lysandros Nikolaou	2f37c355ab	bpo-40334: Fix error location upon parsing an invalid string literal (GH-19962) When parsing a string with an invalid escape, the old parser used to point to the beginning of the invalid string. This commit changes the new parser to match that behaviour, since it's currently pointing to the end of the string (or to be more precise, to the beginning of the next token).	2020-05-07 11:37:51 +01:00
Lysandros Nikolaou	846d8b28ab	bpo-40246: Revert reporting of invalid string prefixes (GH-19888) Due to backwards compatibility concerns regarding keywords immediately followed by a string without whitespace between them (like in `bg="#d00" if clear else"#fca"`) will fail to parse, commit `41d5b94af4` has to be reverted.	2020-05-04 12:32:18 +01:00
Shantanu	c3f001461d	bpo-40491: Fix typo in syntax error for numeric literals (GH-19893)	2020-05-04 11:13:30 +03:00
Lysandros Nikolaou	7f06af684a	bpo-40334: Set error_indicator in _PyPegen_raise_error (GH-19887) Due to PyErr_Occurred not being called at the beginning of each rule, we need to set the error indicator, so that rules do not get expanded after an exception has been thrown	2020-05-04 01:20:09 +01:00
Guido van Rossum	d9d6eadf00	Ensure that tok->type_comments is set on every path (GH-19828)	2020-05-01 17:42:32 +01:00
Batuhan Taskaya	76c1b4d5c5	bpo-40334: Improve column offsets for thrown syntax errors by Pegen (GH-19782)	2020-05-01 14:13:43 +01:00
Lysandros Nikolaou	3e0a6f37df	bpo-40334: Add support for feature_version in new PEG parser (GH-19827) `ast.parse` and `compile` support a `feature_version` parameter that tells the parser to parse the input string, as if it were written in an older Python version. The `feature_version` is propagated to the tokenizer, which uses it to handle the three different stages of support for `async` and `await`. Additionally, it disallows the following at parser level: - The '@' operator in < 3.5 - Async functions in < 3.5 - Async comprehensions in < 3.6 - Underscores in numeric literals in < 3.6 - Await expression in < 3.5 - Variable annotations in < 3.6 - Async for-loops in < 3.5 - Async with-statements in < 3.5 - F-strings in < 3.6 Closes we-like-parsers/cpython#124.	2020-04-30 20:27:52 -07:00
Guido van Rossum	c001c09e90	bpo-40334: Support type comments (GH-19780) This implements full support for # type: <type> comments, # type: ignore <stuff> comments, and the func_type parsing mode for ast.parse() and compile(). Closes https://github.com/we-like-parsers/cpython/issues/95. (For now, you need to use the master branch of mypy, since another issue unique to 3.9 had to be fixed there, and there's no mypy release yet.) The only thing missing is `feature_version=N`, which is being tracked in https://github.com/we-like-parsers/cpython/issues/124.	2020-04-30 12:12:19 -07:00
Pablo Galindo	4db245ee9d	bpo-40334: refactor and cleanup for the PEG generators (GH-19775)	2020-04-29 10:42:21 +01:00
Lysandros Nikolaou	6d65087655	bpo-40334: Disallow invalid single statements in the new parser (GH-19774) After parsing is done in single statement mode, the tokenizer buffer has to be checked for additional lines and a `SyntaxError` must be raised, in case there are any. Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-04-29 02:42:27 +01:00
Pablo Galindo	2208134918	bpo-40334: Explicitly cast to int in pegen.c to fix a compiler warning (GH-19779)	2020-04-29 02:04:06 +01:00
Lysandros Nikolaou	d55133f49f	bpo-40334: Catch E_EOF error, when the tokenizer returns ERRORTOKEN (GH-19743) An E_EOF error was only being caught after the parser exited before this commit. There are some cases though, where the tokenizer returns ERRORTOKEN and has set an E_EOF error (like when EOF directly follows a line continuation character) which weren't correctly handled before.	2020-04-28 01:23:35 +01:00
Pablo Galindo	b94dbd7ac3	bpo-40334: Support PyPARSE_DONT_IMPLY_DEDENT in the new parser (GH-19736)	2020-04-27 18:35:58 +01:00
Pablo Galindo	2b74c835a7	bpo-40334: Support CO_FUTURE_BARRY_AS_BDFL in the new parser (GH-19721) This commit also allows to pass flags to the new parser in all interfaces and fixes a bug in the parser generator that was causing to inline rules with actions, making them disappear.	2020-04-27 18:02:07 +01:00
Pablo Galindo	9f27dd3e16	Use Py_ssize_t instead of ssize_t (GH-19685)	2020-04-24 01:13:33 +01:00

1 2

54 commits