ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2026-04-18 18:00:31 +00:00

Author	SHA1	Message	Date
Andreas Kling	6cdfbd01a6	LibJS: Add alternative source-to-bytecode pipeline in Rust Implement a complete Rust reimplementation of the LibJS frontend: lexer, parser, AST, scope collector, and bytecode code generator. The Rust pipeline is built via Corrosion (CMake-Cargo bridge) and linked into LibJS as a static library. It is gated behind a build flag (ENABLE_RUST, on by default except on Windows) and two runtime environment variables: - LIBJS_CPP: Use the C++ pipeline instead of Rust - LIBJS_COMPARE_PIPELINES=1: Run both pipelines in lockstep, aborting on any difference in AST or bytecode generated. The C++ side communicates with Rust through a C FFI layer (RustIntegration.cpp/h) that passes source text to Rust and receives a populated Executable back via a BytecodeFactory interface.	2026-02-24 09:39:42 +01:00
Andreas Kling	7281091fdb	LibJS: Make bytecode generation infallible Remove CodeGenerationError and make all bytecode generation functions return their results directly instead of wrapping them in CodeGenerationErrorOr. For the few remaining sites where codegen encounters an unimplemented or unexpected AST node, we now use a new emit_todo() helper that emits a NewTypeError + Throw sequence at compile time (preserving the runtime behavior) and then switches to a dead basic block so subsequent codegen for the same function can continue without issue. This allows us to remove error handling from all callers of the bytecode compiler, simplifying the code significantly.	2026-02-12 11:37:43 +01:00
Andreas Kling	e308e73120	LibJS: Move SharedFunctionInstanceData creation out of FunctionNode Add static factory methods create_for_function_node() on SharedFunctionInstanceData and update all callers to use them instead of FunctionNode::ensure_shared_data(). This removes the GC::Root<SharedFunctionInstanceData> cache from FunctionNode, eliminating the coupling between the RefCounted AST and GC-managed runtime objects. The cache was effectively dead code: hoisted declarations use m_functions_to_initialize directly, and function expressions always create fresh instances during codegen.	2026-02-11 23:57:41 +01:00
Andreas Kling	09a11a1a5c	LibJS: Drop AST after first compilation on SourceTextModule Now that initialize_environment() uses pre-computed data and execute_module() caches its executable / uses TLA shared data, we can drop the AST reference after it's no longer needed. For TLA modules, the AST is dropped immediately after constructing the SharedFunctionInstanceData (which takes its own ref). For non-TLA modules, the AST is dropped after the first bytecode compilation. Also remove the m_default_export field (replaced by the pre-computed m_default_export_binding_name) and extract default export info in parse() instead of the constructor.	2026-02-11 23:57:41 +01:00
Andreas Kling	ef943505e2	LibJS: Cache compiled executable and pre-create TLA shared data For non-TLA modules, cache the compiled Bytecode::Executable on the SourceTextModule so we only compile once. For TLA modules, pre-create the SharedFunctionInstanceData for the async wrapper function at construction time. The wrapper function's first invocation will compile it to bytecode and then automatically drop the AST via clear_compile_inputs().	2026-02-11 23:57:41 +01:00
Andreas Kling	fa07fd9951	LibJS: Pre-compute module declaration metadata from AST Extract the data needed by initialize_environment() from the AST at construction time, following the pattern already used by Script for global declaration instantiation. This pre-computes var declared names, lexical bindings (with function indices), functions to initialize (with SharedFunctionInstanceData), and the default export binding name.	2026-02-11 23:57:41 +01:00
Andreas Kling	4d92c4d71a	LibJS: Skip initializing constant slots in ExecutionContext Every function call allocates an ExecutionContext with a trailing array of Values for registers, locals, constants, and arguments. Previously, the constructor would initialize all slots to js_special_empty_value(), but constant slots were then immediately overwritten by the interpreter copying in values from the Executable before execution began. To eliminate this redundant initialization, we rearrange the layout from [registers \| constants \| locals] to [registers \| locals \| constants]. This groups registers and locals together at the front, allowing us to initialize only those slots while leaving constant slots uninitialized until they're populated with their actual values. This reduces the per-call initialization cost from O(registers + locals + constants) to O(registers + locals). Also tightens up the types involved (size_t -> u32) and adds VERIFYs to guard against overflow when computing the combined slot counts, and to ensure the total fits within the 29-bit operand index field.	2026-01-19 10:48:12 +01:00
Timothy Flynn	b060d5dcdf	LibJS: Fix typo in ResolveExport assertion This is an editorial change in the ECMA-262 spec. See: `29c5b17`	2025-12-03 12:08:40 +01:00
Andreas Kling	0dacc94edd	LibJS: Have JS::Lexer take a JS::SourceCode as input This moves the responsibility of setting up a SourceCode object to the users of JS::Lexer. This means Lexer and Parser are free to use string views into the SourceCode internally while working. It also means Lexer no longer has to think about anything other than UTF-16 (or ASCII) inputs. So the unit test for parsing various invalid UTF-8 sequences is deleted here.	2025-11-09 12:14:03 +01:00
Andreas Kling	5706831328	LibJS: Make run_executable() return simple ThrowCompletionOr<Value> We don't need to return two values; running an executable only ever produces a throw completion, or a normal completion, i.e a Value. This necessitated a few minor changes, such as adding a way to check if a JS::Cell is a GeneratorResult.	2025-10-31 08:56:02 +01:00
Andreas Kling	9dae1acc31	LibJS: Pass ExecutionContext to Interpreter::run_executable() This avoids having to get it from the VM's context stack, since most callers already have it on hand.	2025-10-29 21:20:10 +01:00
Andreas Kling	fdb85a330e	LibJS: Stop tracking whether execution context is strict mode or not This was only used for basic testing, and forced us to plumb this flag flag in a bunch of places.	2025-10-29 21:20:10 +01:00
Timothy Flynn	b955c9b2a9	LibJS: Port the Identifier AST (and related) nodes to UTF-16 This eliminates quite a lot of UTF-8 / UTF-16 churn.	2025-08-13 09:56:13 -04:00
Timothy Flynn	0efa98a57a	LibJS+LibWeb+WebContent: Port JS::PropertyKey to UTF-16 This has quite a lot of fall out. But the majority of it is just type or UDL substitution, where the changes just fall through to other function calls. By changing property key storage to UTF-16, the main affected areas are: * NativeFunction names must now be UTF-16 * Bytecode identifiers must now be UTF-16 * Module/binding names must now be UTF-16	2025-08-05 07:07:15 -04:00
Timothy Flynn	3867a192a1	LibJS: Update spec steps / links for the import-assertions proposal This proposal has reached stage 4 and been merged into the main ECMA-262 spec. See: `4e3450e`	2025-04-29 07:33:08 -04:00
Aliaksandr Kalenik	a329868c1b	LibJS: Allocate ExecutionContext memory using alloca() when possible This should be faster than heap allocation. However, heap allocation is still necessary in some cases, such as with generators and async functions.	2025-04-24 10:30:52 +02:00
Aliaksandr Kalenik	c6cd03d7ca	LibJS+LibWeb: Join arguments into vector of registers+constants+locals This is better because: - Better data locality - Allocate vector for registers+constants+locals+arguments in one go instead of allocating two vectors separately	2025-04-24 10:30:52 +02:00
Aliaksandr Kalenik	80a8040794	LibJS+LibWeb: Calculate count of regs+consts+locals before EC allocation This is a preparation step before joining arguments vector into vector of registers+constants+locals.	2025-04-24 10:30:52 +02:00
Andreas Kling	4209b18b88	LibJS: Add ECMAScriptFunctionObject::create_from_function_node() helper This gives us a shared entry point for every situation where we instantiate a function based on a FunctionNode from the AST.	2025-04-08 18:52:35 +02:00
Andreas Kling	3cf50539ec	LibJS: Make Value() default-construct the undefined value The special empty value (that we use for array holes, Optional<Value> when empty and a few other other placeholder/sentinel tasks) still exists, but you now create one via JS::js_special_empty_value() and check for it with Value::is_special_empty_value(). The main idea here is to make it very unlikely to accidentally create an unexpected special empty value.	2025-04-05 11:20:26 +02:00
Andreas Kling	de424d6879	LibJS: Make Completion.[[Value]] non-optional Instead, just use js_undefined() whenever the [[Value]] field is unused. This avoids a whole bunch of presence checks.	2025-04-05 11:20:26 +02:00
Andreas Kling	7477002e46	LibJS: Keep parsed function parameters in a shared data structure Instead of making a copy of the Vector<FunctionParameter> from the AST every time we instantiate an ECMAScriptFunctionObject, we now keep the parameters in a ref-counted FunctionParameters object. This reduces memory usage, and also allows us to cache the bytecode executables for default parameter expressions without recompiling them for every instantiation. :^)	2025-03-27 15:00:43 +00:00
Andreas Kling	46a5710238	LibJS: Use FlyString in PropertyKey instead of DeprecatedFlyString This required dealing with substantial fallout.	2025-03-24 22:27:17 +00:00
Timothy Flynn	85b424464a	AK+Everywhere: Rename `verify_cast` to `as` Follow-up to `fc20e61e72`.	2025-01-21 11:34:06 -05:00
Timothy Flynn	5ea0aa5f08	LibJS: Bring the explicit resource management implementation up to date While we don't yet have a working `using` implementation with our byte code, we can still keep our DisposableStack implementation up to date. The changes brought in here are all editorial, and set us up to start an AsyncDisposableStack implementation.	2025-01-17 20:46:32 +01:00
Shannon Booth	d48a0aaa55	LibJS: Remove unneeded FIXMEs for suspending an execution context From what I understand, the suspension steps are not required now, or in the future for our implementation, or any other. The intent is already implemented in the spec pushing on another execution context to the stack and leaving the running execution context as-is. The resume steps are a slightly different story as there is some subtle behavior which the spec is trying to convey where some custom logic may need to be done when one execution context changes from one to another. It may be worth implementing those steps at a later point in time so that this behavior is a bit easier to follow in those cases. To make the situation more confusing - from what I can gather from the spec, not all cases that the spec mentions resume actually means anything normative. Resume is only _actually_ needed in a limited set of locations. For now, let's just remove the unneeded FIXMEs that indicate that there is something to be done for the suspension steps, as there is not, and leave the resume steps as is.	2025-01-02 11:30:04 +01:00
Timothy Flynn	27478ec7d4	Everywhere: Run clang-format The following command was used to clang-format these files: clang-format-19 -i $(find . \ -not $ -path "./\." -prune $ \ -not $ -path "./Build/" -prune $ \ -not $ -path "./Toolchain/" -prune $ \ -type f -name ".cpp" -o -name ".mm" -o -name ".h")	2024-12-28 05:39:32 -08:00
Luke Wilde	6319dedbcd	LibJS: Perform TLA async function construction in the module context Previously it was only pushing the module context for the call to capture the module execution context. This is incorrect, as the capture occurs upon function construction. This resulted in it capturing the execution context that execute_module was called from, instead of the newly created module_context. `f87041bf3a/Libraries/LibJS/Runtime/ECMAScriptFunctionObject.cpp (L92)` This can be demonstrated with the following setup: index.html: ```html <script> var foo = 1; </script> <script type="module"> import {test} from "./scriptA.mjs"; </script> ``` scriptA.mjs: ```js function foo() { return {a: "b"}; } export let test = await foo(); ``` Before this fix, this would throw: ``` [TypeError] 1 is not a function (evaluated from 'foo') at module code with top-level await at module code with top-level await at <unknown> at <unknown> ``` Fixes #2245.	2024-11-15 18:52:22 +01:00
Shannon Booth	f87041bf3a	LibGC+Everywhere: Factor out a LibGC from LibJS Resulting in a massive rename across almost everywhere! Alongside the namespace change, we now have the following names: * JS::NonnullGCPtr -> GC::Ref * JS::GCPtr -> GC::Ptr * JS::HeapFunction -> GC::Function * JS::CellImpl -> GC::Cell * JS::Handle -> GC::Root	2024-11-15 14:49:20 +01:00
Shannon Booth	1e54003cb1	LibJS+LibWeb: Rename Heap::allocate_without_realm to Heap::allocate Now that the heap has no knowledge about a JavaScript realm and is purely for managing the memory of the heap, it does not make sense to name this function to say that it is a non-realm variant.	2024-11-13 16:51:44 -05:00
Timothy Flynn	93712b24bf	Everywhere: Hoist the Libraries folder to the top-level	2024-11-10 12:50:45 +01:00

31 commits