mirror of
				https://github.com/python/cpython.git
				synced 2025-10-24 18:33:49 +00:00 
			
		
		
		
	
		
			
	
	
		
			180 lines
		
	
	
	
		
			5.8 KiB
		
	
	
	
		
			Text
		
	
	
	
	
	
		
		
			
		
	
	
			180 lines
		
	
	
	
		
			5.8 KiB
		
	
	
	
		
			Text
		
	
	
	
	
	
|   | Description of exception handling in Python 3.11 | ||
|  | ------------------------------------------------ | ||
|  | 
 | ||
|  | Python 3.11 uses what is known as "zero-cost" exception handling. | ||
|  | Prior to 3.11, exceptions were handled by a runtime stack of "blocks". | ||
|  | 
 | ||
|  | In zero-cost exception handling, the cost of supporting exceptions is minimized. | ||
|  | In the common case (where no exception is raised) the cost is reduced | ||
|  | to zero (or close to zero). | ||
|  | The cost of raising an exception is increased, but not by much. | ||
|  | 
 | ||
|  | The following code: | ||
|  | 
 | ||
|  | def f(): | ||
|  |     try: | ||
|  |         g(0) | ||
|  |     except: | ||
|  |         return "fail" | ||
|  | 
 | ||
|  | compiles as follows in 3.10: | ||
|  | 
 | ||
|  |   2           0 SETUP_FINALLY            7 (to 16) | ||
|  | 
 | ||
|  |   3           2 LOAD_GLOBAL              0 (g) | ||
|  |               4 LOAD_CONST               1 (0) | ||
|  |               6 CALL_FUNCTION            1 | ||
|  |               8 POP_TOP | ||
|  |              10 POP_BLOCK | ||
|  |              12 LOAD_CONST               0 (None) | ||
|  |              14 RETURN_VALUE | ||
|  | 
 | ||
|  |   4     >>   16 POP_TOP | ||
|  |              18 POP_TOP | ||
|  |              20 POP_TOP | ||
|  | 
 | ||
|  |   5          22 POP_EXCEPT | ||
|  |              24 LOAD_CONST               3 ('fail') | ||
|  |              26 RETURN_VALUE | ||
|  | 
 | ||
|  | Note the explicit instructions to push and pop from the "block" stack: | ||
|  | SETUP_FINALLY and POP_BLOCK. | ||
|  | 
 | ||
|  | In 3.11, the SETUP_FINALLY and POP_BLOCK are eliminated, replaced with | ||
|  | a table to determine where to jump to when an exception is raised. | ||
|  | 
 | ||
|  |   2           0 NOP | ||
|  | 
 | ||
|  |   3           2 LOAD_GLOBAL              0 (g) | ||
|  |               4 LOAD_CONST               1 (0) | ||
|  |               6 CALL_FUNCTION            1 | ||
|  |               8 POP_TOP | ||
|  |              10 LOAD_CONST               0 (None) | ||
|  |              12 RETURN_VALUE | ||
|  |         >>   14 PUSH_EXC_INFO | ||
|  | 
 | ||
|  |   4          16 POP_TOP | ||
|  |              18 POP_TOP | ||
|  |              20 POP_TOP | ||
|  | 
 | ||
|  |   5          22 POP_EXCEPT | ||
|  |              24 LOAD_CONST               2 ('fail') | ||
|  |              26 RETURN_VALUE | ||
|  |         >>   28 POP_EXCEPT_AND_RERAISE | ||
|  | ExceptionTable: | ||
|  |   2 to 8 -> 14 [0] | ||
|  |   14 to 20 -> 28 [3] lasti | ||
|  | 
 | ||
|  | (Note this code is from an early 3.11 alpha, the NOP may well have be removed before release). | ||
|  | 
 | ||
|  | If an instruction raises an exception then its offset is used to find the target to jump to. | ||
|  | For example, the CALL_FUNCTION at offset 6, falls into the range 2 to 8. | ||
|  | So, if g() raises an exception, then control jumps to offset 14. | ||
|  | 
 | ||
|  | 
 | ||
|  | Unwinding | ||
|  | --------- | ||
|  | 
 | ||
|  | When an exception is raised, the current instruction offset is used to find following: | ||
|  | target to jump to, stack depth, and 'lasti', which determines whether the instruction | ||
|  | offset of the raising instruction should be pushed. | ||
|  | 
 | ||
|  | This information is stored in the exception table, described below. | ||
|  | 
 | ||
|  | If there is no relevant entry, the exception bubbles up to the caller. | ||
|  | 
 | ||
|  | If there is an entry, then: | ||
|  |  1. pop values from the stack until it matches the stack depth for the handler, | ||
|  |  2. if 'lasti' is true, then push the offset that the exception was raised at. | ||
|  |  3. push the exception to the stack as three values: traceback, value, type, | ||
|  |  4. jump to the target offset and resume execution. | ||
|  | 
 | ||
|  | 
 | ||
|  | Format of the exception table | ||
|  | ----------------------------- | ||
|  | 
 | ||
|  | Conceptually, the exception table consists of a sequence of 5-tuples: | ||
|  |     1. start-offset (inclusive) | ||
|  |     2. end-offset (exclusive) | ||
|  |     3. target | ||
|  |     4. stack-depth | ||
|  |     5. push-lasti (boolean) | ||
|  | 
 | ||
|  | All offsets and lengths are in instructions, not bytes. | ||
|  | 
 | ||
|  | We want the format to be compact, but quickly searchable. | ||
|  | For it to be compact, it needs to have variable sized entries so that we can store common (small) offsets compactly, but handle large offsets if needed. | ||
|  | For it to be searchable quickly, we need to support binary search giving us log(n) performance in all cases. | ||
|  | Binary search typically assumes fixed size entries, but that is not necesary, as long as we can identify the start of an entry. | ||
|  | 
 | ||
|  | It is worth noting that the size (end-start) is always smaller than the end, so we encode the entries as: | ||
|  |     start, size, target, depth, push-lasti | ||
|  | 
 | ||
|  | Also, sizes are limited to 2**30 as the code length cannot exceed 2**31 and each instruction takes 2 bytes. | ||
|  | It also happens that depth is generally quite small. | ||
|  | 
 | ||
|  | So, we need to encode: | ||
|  |     start (up to 30 bits) | ||
|  |     size (up to 30 bits) | ||
|  |     target (up to 30 bits) | ||
|  |     depth (up to ~8 bits) | ||
|  |     lasti (1 bit) | ||
|  | 
 | ||
|  | We need a marker for the start of the entry, so the first byte of entry will have the most significant bit set. | ||
|  | Since the most significant bit is reserved for marking the start of an entry, we have 7 bits per byte to encode offsets. | ||
|  | Encoding uses a standard varint encoding, but with only 7 bits instead of the usual 8. | ||
|  | The 8 bits of a bit are (msb left) SXdddddd where S is the start bit. X is the extend bit meaning that the next byte is required to extend the offset. | ||
|  | 
 | ||
|  | In addition, we will combine depth and lasti into a single value, ((depth<<1)+lasti), before encoding. | ||
|  | 
 | ||
|  | For example, the exception entry: | ||
|  |     start:  20 | ||
|  |     end:    28 | ||
|  |     target: 100 | ||
|  |     depth:  3 | ||
|  |     lasti:  False | ||
|  | 
 | ||
|  | is encoded first by converting to the more compact four value form: | ||
|  |     start:         20 | ||
|  |     size:          8 | ||
|  |     target:        100 | ||
|  |   depth<<1+lasti:  6 | ||
|  | 
 | ||
|  | which is then encoded as: | ||
|  |     148 (MSB + 20 for start) | ||
|  |     8   (size) | ||
|  |     65  (Extend bit + 1) | ||
|  |     36  (Remainder of target, 100 == (1<<6)+36) | ||
|  |     6 | ||
|  | 
 | ||
|  | for a total of five bytes. | ||
|  | 
 | ||
|  | 
 | ||
|  | 
 | ||
|  | Script to parse the exception table | ||
|  | ----------------------------------- | ||
|  | 
 | ||
|  | def parse_varint(iterator): | ||
|  |     b = next(iterator) | ||
|  |     val = b & 63 | ||
|  |     while b&64: | ||
|  |         val <<= 6 | ||
|  |         b = next(iterator) | ||
|  |         val |= b&63 | ||
|  |     return val | ||
|  | 
 | ||
|  | def parse_exception_table(code): | ||
|  |     iterator = iter(code.co_exceptiontable) | ||
|  |     try: | ||
|  |         while True: | ||
|  |             start = parse_varint(iterator)*2 | ||
|  |             length = parse_varint(iterator)*2 | ||
|  |             end = start + length - 2 # Present as inclusive, not exclusive | ||
|  |             target = parse_varint(iterator)*2 | ||
|  |             dl = parse_varint(iterator) | ||
|  |             depth = dl >> 1 | ||
|  |             lasti = bool(dl&1) | ||
|  |             yield start, end, target, depth, lasti | ||
|  |     except StopIteration: | ||
|  |         return |