ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2025-10-19 15:43:20 +00:00

Author	SHA1	Message	Date
Ali Mohammad Pur	6a6f747701	LibWasm: Add support for proposal 'tail-call'	2025-10-15 01:26:29 +02:00
Ali Mohammad Pur	02b3c4f8a9	LibWasm: Utilise direct threading if/when possible ~50% performance improvement on coremark.	2025-10-01 23:47:29 +02:00
Ali Mohammad Pur	cf30d61d8b	LibWasm: Use a faster way to detect live registers Instead of doing a naive O(n^2) liveness detection loop, use a bitmap for values allocated to registers. This cuts down validating time from 20% to 1.4% of runtime on the same game as last commit.	2025-10-01 23:47:29 +02:00
Ali Mohammad Pur	22448b0c35	LibWasm: Move the interpreter IP out of the configuration object This, along with moving the sources and destination out of the config object, makes it so we don't have to double-deref to get to them on each instruction, leading to a ~15% perf improvement on dispatch.	2025-08-26 15:20:33 +02:00
Ali Mohammad Pur	d8ea9e67f8	LibWasm: Access registers directly without bounds checks The register array is guaranteed to be large enough for all registers used in the program, so get rid of the bounds checks.	2025-08-26 15:20:33 +02:00
Ali Mohammad Pur	f7bdc596b4	LibWasm: Avoid allocations for the label stack as much as possible Namely, find an upper bound at validation time so we can allocate the space when entering the frame. Also drop labels at once instead of popping them off one at a time now that we're using a Vector.	2025-08-26 15:20:33 +02:00
Ali Mohammad Pur	6732e1cdc3	LibWasm: Don't clobber registers on (most) calls This still passes the values on the stack, but registers are now allowed to cross a call boundary. This is a very significant (>50%) improvement on the small call microbenchmarks on my machine.	2025-08-26 15:20:33 +02:00
Ali Mohammad Pur	0e5ecef848	LibWasm: Try really hard to avoid touching the value stack This commit adds a register allocator, with 8 available "register" slots. In testing with various random blobs, this moves anywhere from 30% to 74% of value accesses into predefined slots, and is about a ~20% perf increase end-to-end. To actually make this usable, a few structural changes were also made: - we no longer do one instruction per interpret call - trapping is an (unlikely) exit condition - the label and frame stacks are replaced with linked lists with a huge node cache size, as we only need to touch the last element and push/pop is very frequent.	2025-08-08 12:54:06 +02:00
Ali Mohammad Pur	931b554f68	LibWasm: Give some inline capacity to the frame and label stacks The average wasm function rarely goes over these bounds for the labels (32 nested control structures), and 8 frames is just enough to clear most initialization code/start section without allocating anything.	2025-08-08 12:54:06 +02:00
Ali Mohammad Pur	dc67f0ad4e	LibWasm: Hold on to the stack depth for expressions in the validator This allows preallocating the value stack when pushing frames, avoiding repeated reallocs and copies.	2025-08-08 12:54:06 +02:00
Timothy Flynn	93712b24bf	Everywhere: Hoist the Libraries folder to the top-level	2024-11-10 12:50:45 +01:00

11 commits