ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2026-04-19 02:10:26 +00:00

Author	SHA1	Message	Date
Andreas Kling	31606fddd3	LibJS: Add Mov2/Mov3 instructions to reduce dispatch overhead Add Mov2 and Mov3 bytecode instructions that perform 2 or 3 register moves in a single dispatch. A peephole optimization pass during bytecode assembly merges consecutive Mov instructions within each basic block into these combined instructions. When merging, identical Movs are deduplicated (e.g. two identical Movs become a single Mov, not a Mov2). This optimization is implemented in both the C++ and Rust codegen pipelines. The goal is to reduce the per-instruction dispatch overhead, which is significant compared to the actual cost of moving a value. This isn't fancy or elegant, but provides a real speed-up on many workloads. As an example, Kraken/imaging-desaturate.js improves by ~1.07x on my laptop.	2026-03-11 17:04:32 +01:00
InvalidUsernameException	bb762fb43b	LibJS: Do not assume arguments cannot be clobbered `copy_if_needed_to_preserve_evaluation_order` was introduced in `c372a084a2`. At that point function arguments still needed to be copied into registers with a special `GetArgument` instructions. Later, in `3f04d18ef7` this was changed and arguments were made their own operand type that can be accessed directly instead. Similar to locals, arguments can also be overwritten due to evaluation order in various scenarios. However, the function was never updated to account for that. Rectify that here. With this change, https://volkswagen.de no longer gets blanked shortly after initial load and the unhandled JS exception spam on that site is gone too.	2026-03-08 15:01:07 +01:00
InvalidUsernameException	34b7cb6e55	LibJS: Explicitly handle all operand types when determining clobbering The last time a new operand type was added, the effects from that on the function changed in this commit were seemingly not properly considered, introducing a bug. To avoid such errors in the future, rewrite the code to produce a compile-time error if new operand types are added. No functional changes yet, the actual bugfix will be in a followup-commit.	2026-03-08 15:01:07 +01:00
Andreas Kling	54a1a66112	LibJS: Store cache pointers directly in bytecode instructions Instead of storing a u32 index into a cache vector and looking up the cache at runtime through a chain of dependent loads (load Executable, load vector data pointer, multiply index, add), store the actual cache pointer as a u64 directly in the instruction stream. A fixup pass (Executable::fixup_cache_pointers()) runs after Executable construction in both the Rust and C++ pipelines, walking the bytecode and replacing each index with the corresponding pointer. The cache pointer type is encoded in Bytecode.def (e.g. PropertyLookupCache, GlobalVariableCache*) so the fixup switch is auto-generated by the Python Op code generator, making it impossible to forget updating the fixup when adding new cached instructions. This eliminates 3-4 dependent loads on every inline cache access in both the C++ interpreter and the assembly interpreter.	2026-03-08 10:27:13 +01:00
Andreas Kling	56e09695e0	LibJS: Consolidate Put bytecode instructions and reduce code bloat Replace 20 separate Put instructions (5 PutKinds x 4 forms) with 4 unified instructions (PutById, PutByIdWithThis, PutByValue, PutByValueWithThis), each carrying a PutKind field at runtime instead of being a separate opcode. This reduces the number of handler entry points in the dispatch loop and eliminates template instantiations of put_by_property_key and put_by_value that were being duplicated 5x each when inlined by LTO.	2026-03-04 18:53:12 +01:00
Andreas Kling	176a618fce	LibJS: Don't emit dead code after Throw for invalid LHS expressions When the left-hand side of an assignment, update, or for-in loop is invalid (e.g. `foo() = "bar"`), the bytecode generator emits a Throw instruction. Previously, it would also create a dead basic block after the Throw, resulting in unreachable instructions in the output. Fix this by returning early from the relevant codegen paths after emitting the Throw, and by guarding for-in/for-of body generation with an is_current_block_terminated() check.	2026-03-01 21:20:54 +01:00
Andreas Kling	234203ed9b	LibJS: Ensure deterministic ordering in scope analysis and codegen The scope collector uses HashMaps for identifier groups and variables, which means their iteration order is non-deterministic. This causes local variable indices and function declaration instantiation (FDI) bytecode to vary between runs. Fix this by sorting identifier group keys alphabetically before assigning local variable indices, and sorting vars_to_initialize by name before emitting FDI bytecode. Also make register allocation deterministic by always picking the lowest-numbered free register instead of whichever one happens to be at the end of the free list. This is preparation for bringing in a new source->bytecode pipeline written in Rust. Checking for regressions is significantly easier if we can expect identical output from both pipelines.	2026-02-24 09:39:42 +01:00
Andreas Kling	cd2576c031	LibJS: Mark block-scoped function declaration locals as initialized When emitting block declaration instantiation, we were not calling set_local_initialized() after writing block-scoped function declarations to local variables via Mov. This caused unnecessary ThrowIfTDZ checks to be emitted when those locals were later read. Block-scoped function declarations are always initialized at block entry (via NewFunction + Mov), so TDZ checks for them are redundant.	2026-02-19 02:45:37 +01:00
Andreas Kling	47e552e8fd	LibJS: Consolidate TDZ check emission into Generator helper Move the duplicated ThrowIfTDZ emission logic from three places in ASTCodegen.cpp into a single Generator::emit_tdz_check_if_needed() helper. This handles both argument TDZ (which requires a Mov to empty first) and lexically-declared variable TDZ uniformly. This avoids emitting some unnecessary ThrowIfTDZ instructions.	2026-02-17 20:44:57 +01:00
Andreas Kling	19bf3f9479	LibJS: Use a forward cursor for source map lookup during compilation The find_source_record lambda was doing a reverse linear scan through the entire source map for every instruction emitted, resulting in quadratic behavior. This was catastrophic for large scripts like Octane/mandreel.js, where compile() dominated the profile at ~30s. Since both source map entries and instruction iteration are ordered by offset, replace the per-instruction scan with a forward cursor that advances in lockstep with instruction emission.	2026-02-16 20:41:02 +01:00
Andreas Kling	1d145eec72	LibJS: Fix phantom source map entries from assembly-time optimizations The compile() function was adding source map entries for all instructions in a block upfront, before processing assembly-time optimizations (Jump-to-next-block elision, Jump-to-Return/End inlining, JumpIf-to-JumpTrue/JumpFalse conversion). When a Jump was skipped, its phantom source map entry remained at the offset where the next block's first instruction would be placed, causing binary_search to find the wrong source location for error messages. Fix by building source map entries inline with instruction emission, ensuring only actually-emitted instructions get entries. For blocks with duplicate source map entries at the same offset (from rewind in fuse_compare_and_jump), the last entry is used.	2026-02-15 23:21:46 +01:00
Andreas Kling	2dca137d9e	LibJS: Handle ThisExpression in expression_identifier() Add ThisExpression handling to the expression_identifier() helper used for base_identifier in bytecode instructions. This makes PutById and GetById emit base_identifier:this when the base is a this expression.	2026-02-15 23:21:46 +01:00
Andreas Kling	49f2f1e7cd	LibJS: Skip unnecessary Mov in emit_load_from_reference for reads When MemberExpression::generate_bytecode calls emit_load_from_reference, it only uses the loaded_value and discards the reference operands. For computed member expressions (e.g. a[0]), this was generating an unnecessary Mov to save the property register for potential store-back. Add a ReferenceMode parameter to emit_load_from_reference. When LoadOnly is passed, the computed property path skips the register save and Mov.	2026-02-15 23:21:46 +01:00
Andreas Kling	c0f38c82d8	LibJS: Fix evaluation order in array destructuring assignment Per AssignmentRestElement and AssignmentElement in the specification, the DestructuringAssignmentTarget reference must be evaluated before iterating or stepping the iterator. We were doing it in the wrong order, which caused observable differences when the target evaluation has side effects, and could lead to infinite loops when the iterator never completes. Add Generator::emit_evaluate_reference() to evaluate a member expression's base and property into ReferenceOperands without performing a load or store, then use the pre-evaluated reference for the store after iteration completes.	2026-02-15 23:21:46 +01:00
Andreas Kling	af57184627	LibJS: Fix scoping of function declarations with destructured params When a function has parameter expressions (e.g. destructured params with defaults), CreateVariableEnvironment creates a separate variable environment for function declarations and sets it as the current lexical environment at runtime. However, the bytecode generator's m_lexical_environment_register_stack was not updated to reflect this, so subsequent CreateLexicalEnvironment ops would parent themselves to the old (pre-variable-environment) lexical environment, skipping the variable environment entirely. This meant function declarations hoisted into the variable environment were invisible to closures created in the function body. Fix this by capturing the new lexical environment into a register after CreateVariableEnvironment and pushing it onto the environment register stack. This fixes a problem where https://tumblr.com/ wouldn't load the feed.	2026-02-12 16:59:47 +01:00
Andreas Kling	322ad1363e	LibJS: Throw ReferenceError for invalid assignment targets like foo()=x CallExpression is accepted as an assignment target for web compatibility (Annex B), but must throw ReferenceError at runtime. We were incorrectly throwing TypeError with a TODO message. Replace emit_todo() calls in three codegen paths (simple assignment, compound assignment/update, and for-in/of) with proper ReferenceError using the "Invalid left-hand side in assignment" message, matching the behavior of V8 and JSC.	2026-02-12 11:37:43 +01:00
Andreas Kling	7281091fdb	LibJS: Make bytecode generation infallible Remove CodeGenerationError and make all bytecode generation functions return their results directly instead of wrapping them in CodeGenerationErrorOr. For the few remaining sites where codegen encounters an unimplemented or unexpected AST node, we now use a new emit_todo() helper that emits a NewTypeError + Throw sequence at compile time (preserving the runtime behavior) and then switches to a dead basic block so subsequent codegen for the same function can continue without issue. This allows us to remove error handling from all callers of the bytecode compiler, simplifying the code significantly.	2026-02-12 11:37:43 +01:00
Andreas Kling	20f50d1f71	LibJS: Convert builtin validation codegen errors to VERIFY These checks validate engine-internal usage of builtin abstract operations (arity, argument types, known operation names), not user JS code. Replace CodeGenerationError returns with VERIFY() assertions: - Spread argument check becomes VERIFY(!argument.is_spread) - Arity checks become VERIFY(arguments.size() == N) - StringLiteral type checks become VERIFY(message) - Unknown operation/constant fallthroughs become VERIFY_NOT_REACHED()	2026-02-12 11:37:43 +01:00
Andreas Kling	de1b6d4f07	LibJS: Convert unreachable codegen error sites to VERIFY Replace CodeGenerationError returns with VERIFY_NOT_REACHED() or VERIFY() at sites that are provably unreachable: - Non-computed member expression fallbacks in emit_load_from_reference, emit_store_to_reference, and emit_delete_reference (member expression properties are always computed, identifier, or private identifier) - Two non-computed member expression fallbacks in AssignmentExpression - Default case in compound assignment switch (all 15 AssignmentOp values are handled) - BindingPattern Empty/Expression name+alias pair (computed property names always require an alias) - Two assignment+destructuring combinations in for-in/of body evaluation (is_destructuring is only set for VariableDeclaration lhs, which always has VarBinding or LexicalBinding kind, never Assignment)	2026-02-12 11:37:43 +01:00
Andreas Kling	e308e73120	LibJS: Move SharedFunctionInstanceData creation out of FunctionNode Add static factory methods create_for_function_node() on SharedFunctionInstanceData and update all callers to use them instead of FunctionNode::ensure_shared_data(). This removes the GC::Root<SharedFunctionInstanceData> cache from FunctionNode, eliminating the coupling between the RefCounted AST and GC-managed runtime objects. The cache was effectively dead code: hoisted declarations use m_functions_to_initialize directly, and function expressions always create fresh instances during codegen.	2026-02-11 23:57:41 +01:00
Andreas Kling	4c7a349b62	LibJS: Remove #include <AST.h> from SharedFunctionInstanceData.h Extract FunctionParsingInsights into its own header and introduce FunctionLocal as a standalone mirror of Identifier::Local. This allows SharedFunctionInstanceData.h to avoid pulling in the full AST type hierarchy, reducing transitive include bloat. The AST.h include is kept in SharedFunctionInstanceData.cpp where it's needed for the constructor that accesses AST node types.	2026-02-11 23:57:41 +01:00
Andreas Kling	712d3fc54f	LibJS: Pre-compute ScopeNode queries in SharedFunctionInstanceData Pre-compute the data that emit_function_declaration_instantiation previously obtained by querying ScopeNode methods at codegen time: - m_has_scope_body: whether ecmascript_code is a ScopeNode - m_has_non_local_lexical_declarations: from ScopeNode query - m_lexical_bindings: non-local lexically-scoped identifier names and their constant-declaration status After this change, emit_function_declaration_instantiation no longer casts m_ecmascript_code to ScopeNode or calls any ScopeNode methods.	2026-02-11 23:57:41 +01:00
Andreas Kling	d36521a698	LibJS: Replace m_functions_to_initialize with pre-created data Replace Vector<FunctionDeclaration const&> with a FunctionToInitialize struct that stores a pre-created SharedFunctionInstanceData, function name, and local index. The SharedFunctionInstanceData for each hoisted function is created eagerly during the parent's construction, removing the need to reference FunctionDeclaration AST nodes after construction.	2026-02-11 23:57:41 +01:00
Andreas Kling	7cc392551b	LibJS: Replace VariableNameToInitialize with value-type VarBinding Replace VariableNameToInitialize (which holds Identifier const&) with a VarBinding struct that stores pre-extracted values: name, local index, parameter_binding, and function_name. This removes a reference to AST Identifier nodes from SharedFunctionInstanceData, allowing the AST to be freed after compilation.	2026-02-11 23:57:41 +01:00
Andreas Kling	6decb93dd7	LibJS: Populate ClassBlueprint during codegen Build a ClassBlueprint from ClassExpression elements at codegen time: - Methods/getters/setters: register SharedFunctionInstanceData from the method's FunctionExpression - Field initializers with literal values (numbers, booleans, null, strings, negated numbers): store the value directly, avoiding function creation entirely - Field initializers with non-literal values: wrap in ClassFieldInitializerStatement and create SharedFunctionInstanceData - Static initializers: create SharedFunctionInstanceData from the function body - Constructor: register SharedFunctionInstanceData from the constructor's FunctionExpression Add public accessors to ClassMethod::function() and StaticInitializer::function_body() for codegen access. The blueprint is registered but not yet used by NewClass (dual path). No behavioral change.	2026-02-11 23:57:41 +01:00
Andreas Kling	6b0003b057	LibJS: Pre-create SharedFunctionInstanceData in NewFunction Replace the FunctionNode const& stored on the NewFunction bytecode instruction with an index into a table of pre-created SharedFunctionInstanceData objects on the Executable. During bytecode compilation, we now eagerly create SharedFunctionInstanceData for each function that will be instantiated by NewFunction, and store it on both the FunctionNode (for caching) and the Executable (for GC tracing). At runtime, NewFunction simply looks up the SharedFunctionInstanceData by index and calls create_from_function_data() directly, bypassing the AST entirely. This removes one of the main reasons the AST had to stay alive after compilation. The instantiate_ordinary_function_expression() helper in Interpreter.cpp is removed as its non-trivial code path (creating a scope for named function expressions) was dead code -- it was only called when !has_name(), so the has_own_name branch never executed.	2026-02-11 23:57:41 +01:00
Andreas Kling	658ba1d023	LibJS: Clear compile-only data from SharedFunctionInstanceData After successful bytecode compilation, the m_functions_to_initialize and m_var_names_to_initialize_binding vectors are no longer needed as they are only consumed by emit_function_declaration_instantiation() during code generation. Add clear_compile_inputs() to release these vectors post-compile, and call it from both ECMAScriptFunctionObject::get_stack_frame_size() and NativeJavaScriptBackedFunction::bytecode_executable() after their respective lazy compilation succeeds. Also add a pre-compile assertion in Generator::generate_from_function() to verify we never try to compile the same function data twice, and a VERIFY in ECMAScriptFunctionObject::ecmascript_code() to guard against null dereference.	2026-02-11 23:57:41 +01:00
Andreas Kling	933eee8284	LibJS: Throw ReferenceError for delete super[...] at codegen time delete super.x and delete super[expr] always throw a ReferenceError per spec. Instead of deferring this to runtime via DeleteByIdWithThis and DeleteByValueWithThis instructions, emit the throw directly during bytecode generation. Remove the now-unused DeleteByIdWithThis and DeleteByValueWithThis instructions, and add a NewReferenceError instruction.	2026-02-11 14:29:36 +01:00
Andreas Kling	479b89aa6d	LibJS: Fix UpdateEmpty completion value semantics for loops/switch/if When a loop or switch body produces an abrupt completion (break or continue) with an empty value, the ES spec requires UpdateEmpty to replace the empty value with the last non-empty completion value V. The bytecode compiler was failing to do this because it only updated the completion register after body codegen, guarded by !is_current_block_terminated(). When break/continue terminated the block, the update was skipped. Fix this with three changes: 1. Introduce a CompletionRegisterScope that tells ScopeNode::generate_bytecode to eagerly emit Mov instructions into the completion register after each value-producing statement. This ensures the register is up to date before any break or continue fires. 2. Give IfStatement its own CompletionRegisterScope (initialized to undefined) during branch evaluation. This models the spec's UpdateEmpty(stmtCompletion, undefined) for if-statements: when break/continue fires inside an if-branch, the scoped jump propagation sees that the if's completion register differs from the loop's and emits a Mov, correctly replacing the eagerly written value with undefined. Without this, code like { 3; if (true) { break; } else { } } would incorrectly carry the value 3 instead of undefined through the break. 3. Capture loop body results and emit a fallback Mov for non-ScopeNode bodies (e.g. bare expression statements like do x=1; while(false)) that don't participate in the eager CompletionRegisterScope update mechanism. For labelled break/continue that cross loop boundaries, the jump codegen now propagates the inner completion register to the target scope's completion register before emitting the jump. Also fix ForStatement to use a proper completion register (previously it returned the body result directly, which was wrong for empty bodies and break-with-no-value cases).	2026-02-11 14:29:36 +01:00
pwespi	6c471c5ef7	LibJS: Do not allow reassignment to local const variable	2026-02-09 21:06:46 +01:00
Andreas Kling	720fd567b1	LibJS: Collapse handler/finalizer into single exception handler target After replacing the runtime unwind context stack with explicit completion records for try/finally dispatch, the distinction between "handler" (catch) and "finalizer" (finally) in the exception handler table is no longer meaningful at runtime. handle_exception() checked handler first, then finalizer, but they did the exact same thing (set the PC). When both were present, the finalizer was dead code. Collapse both fields into a single handler_offset (now non-optional, since an entry always has a target), remove the finalizer concept from BasicBlock, UnwindContext, and ExceptionHandlers, and simplify handle_exception() to a direct assignment.	2026-02-09 16:35:39 +01:00
Andreas Kling	cbca493b28	LibJS: Remove BlockBoundaryType::Unwind With LeaveUnwindContext gone, the Unwind boundary type has no purpose. Remove it from the enum and all start/end boundary calls.	2026-02-09 16:35:39 +01:00
Andreas Kling	5abe40874a	LibJS: Remove LeaveUnwindContext opcode LeaveUnwindContext popped the runtime unwind context stack. With the stack being removed, all emission sites become dead code. Remove the opcode and all its emissions.	2026-02-09 16:35:39 +01:00
Andreas Kling	7f89158d20	LibJS: Replace implicit environment stack with explicit registers Replace the saved_lexical_environments stack in ExecutionContextRareData with explicit register-based environment tracking. Environments are now stored in registers and restored via SetLexicalEnvironment, making the environment flow visible in bytecode. Key changes: - Add GetLexicalEnvironment and SetLexicalEnvironment opcodes - CreateLexicalEnvironment takes explicit parent and dst operands - EnterObjectEnvironment stores new environment in a dst register - NewClass takes an explicit class_environment operand - Remove LeaveLexicalEnvironment opcode (instead: SetLexicalEnvironment) - Remove saved_lexical_environments from ExecutionContextRareData - Use a reserved register for the saved lexical environment to avoid dominance issues with lazily-emitted GetLexicalEnvironment	2026-02-09 16:35:39 +01:00
Andreas Kling	a439dc8490	LibJS: Use explicit completion records for try/finally dispatch Each finally scope gets two registers (completion_type and completion_value) that form an explicit completion record. Every path into the finally body sets these before jumping, and a dispatch chain after the finally body routes to the correct continuation. This replaces the old implicit protocol that relied on the exception register, a saved_return_value register, and a scheduled_jump field on ExecutionContext, allowing us to remove: - 5 opcodes (ContinuePendingUnwind, ScheduleJump, LeaveFinally, RestoreScheduledJump, PrepareYield) - 1 reserved register (saved_return_value) - 2 ExecutionContext fields (scheduled_jump, previously_scheduled_jumps)	2026-02-09 08:51:12 +01:00
Andreas Kling	7997267942	LibJS: Remove outdated FIXME comments about ToPropertyKey ordering The FIXME comments suggested that ToPropertyKey was called at the wrong time for computed super property access. However, extensive testing shows that both Ladybird and V8 implement the correct ordering according to the ECMA262 specification. Remove the outdated FIXME comments and add comprehensive test coverage for super property computed keys with Symbol.toPrimitive to prevent regressions.	2026-02-09 01:23:48 +01:00
dosisod	2c3077b878	LibJS: Dead code elimination for always truthy/falsey conditions This improves and expands the ability to do dead code elimination on conditions which are always truthy or falsey. The following cases are now optimized: * `if (true){}` -> Only emit `if` block, ignore `else` * `if (false){}` -> Only emit `else if`/`else` block * `while (false){}` -> Ignore `while` loop entirely * `for (x;false;){}` -> Only emit `x` (if it exists), skip `for` block * Ternary -> Directly return left/right hand side if condition is const	2026-01-31 18:22:40 +01:00
Andreas Kling	81bee185e6	LibJS: Replace source map HashMap with sorted Vector Bytecode source map entries are always added in order of increasing bytecode offset, and lookups only happen during error handling (a cold path). This makes a sorted vector with binary search a better fit than a hash map. This change reduces memory overhead and speeds up bytecode generation by avoiding hash table operations during compilation. Lookups remain fast via binary search, and since source_range_at() is only called when generating stack traces, the O(log n) lookup is acceptable.	2026-01-26 19:37:42 +01:00
dosisod	ac8cc6d24b	LibJS: Constant fold `LogicalExpression` Logical expressions like `true \|\| false` are now constant folded. This also allows for dead code elimination if we know the right-hand side of the expression will never be evaluated (such as `false && f()` or `true \|\| f()`). In the test suites, the values are now being constant folded at compile time. To ensure that the actual evaluation logic is being called properly, I had to duplicate the tests and call them via a function so the compiler would not optimize the evaluation logic away. This also demotes `NaN` and `Infinity` identifiers to `nan` and `inf` double literals, which will further help with const folding.	2026-01-22 08:47:18 +01:00
Andreas Kling	4d92c4d71a	LibJS: Skip initializing constant slots in ExecutionContext Every function call allocates an ExecutionContext with a trailing array of Values for registers, locals, constants, and arguments. Previously, the constructor would initialize all slots to js_special_empty_value(), but constant slots were then immediately overwritten by the interpreter copying in values from the Executable before execution began. To eliminate this redundant initialization, we rearrange the layout from [registers \| constants \| locals] to [registers \| locals \| constants]. This groups registers and locals together at the front, allowing us to initialize only those slots while leaving constant slots uninitialized until they're populated with their actual values. This reduces the per-call initialization cost from O(registers + locals + constants) to O(registers + locals). Also tightens up the types involved (size_t -> u32) and adds VERIFYs to guard against overflow when computing the combined slot counts, and to ensure the total fits within the 29-bit operand index field.	2026-01-19 10:48:12 +01:00
Andreas Kling	505fe0a977	LibJS: Add shape caching for object literal instantiation When a function creates object literals with simple property names, we now cache the resulting shape after the first instantiation. On subsequent calls, we create the object with the cached shape directly and write property values at their known offsets. This avoids repeated shape transitions and property offset lookups for a common JavaScript pattern. The optimization uses two new bytecode instructions: - CacheObjectShape: Captures the final shape after object construction - InitObjectLiteralProperty: Writes properties using cached offsets Only "simple" object literals are optimized (string literal keys with simple value expressions). Complex cases like computed properties, getters/setters, and spread elements use the existing slow path. 3.4x speedup on a microbenchmark that repeatedly instantiates an object literal with 26 properties. Small progressions on various benchmarks.	2026-01-10 00:56:51 +01:00
Luke Wilde	c4c9ac08ad	LibJS: Follow the spec more closely for tagged template literals This resolves a FIXME in its code generation, particularly for: - Caching the template object - Setting the correct property attributes - Freezing the resulting objects This allows archive.org to load, which uses the Lit library. The Lit library caches these template objects to determine if a template has changed, allowing it to determine to do a full template rerender or only partially update the rendering. Before, we would always cause a full rerender on update because we didn't return the same template object. This caused issues with archive.org's code, I believe particularly with its router library, where we would constantly detach and reattach nodes unexpectedly, ending up with the page content not being attached to the router's custom element.	2026-01-06 23:25:36 +01:00
Andreas Kling	ece0b72e3c	LibJS: Don't set [[HomeObject]] for non-method object properties This fixes an issue where we'd incorrectly retain objects via the [[HomeObject]] slot. This common pattern was affected: Object.defineProperty(o, "foo", { get: function() { return 123; } }); Above, the object literal would get assigned to the [[HomeObject]] slot even though "get" is not a "method" per the spec. This frees about 30,000 objects on my x.com home feed.	2025-12-17 12:50:17 -06:00
Andreas Kling	a62daf2a88	LibJS: Remove redundant PutByNumericId instructions These were helpful when PropertyKey instantiation happened in the interpreter, but now that we've moved it to bytecode generation time, we can use the basic PutById instructions instead.	2025-12-11 14:34:45 -06:00
Andreas Kling	bad16dc0e0	LibJS: Cache fully-formed PropertyKeys in Executable Instead of creating PropertyKeys on the fly during interpreter execution, we now store fully-formed ones in the Executable. This avoids a whole bunch of busywork in property access instructions and substantially reduces code size bloat.	2025-12-11 14:34:45 -06:00
Luke Wilde	0eceee0a05	LibJS: Replace Array.fromAsync with a native JavaScript implementation This allows us to use the bytecode implementation of await, which correctly suspends execution contexts and handles completion injections. This gains us 4 test262 tests around mutating Array.fromAsync's iterable whilst it's suspended as well. This is also one step towards removing spin_until, which the non-bytecode implementation of await uses. ``` Duration: -5.98s Summary: Diff Tests: +4 ✅ -4 ❌ Diff Tests: [...]/Array/fromAsync/asyncitems-array-add-to-singleton.js ❌ -> ✅ [...]/Array/fromAsync/asyncitems-array-add.js ❌ -> ✅ [...]/Array/fromAsync/asyncitems-array-mutate.js ❌ -> ✅ [...]/Array/fromAsync/asyncitems-array-remove.js ❌ -> ✅ ```	2025-11-30 11:54:54 +01:00
Luke Wilde	a63b0cfaba	LibJS: Introduce NativeJavaScriptBackedFunction This hosts the ability to compile and run JavaScript to implement native functions. This is particularly useful for any native function that is not a normal function, for example async functions such as Array.fromAsync, which require yielding. These functions are not allowed to observe anything from outside their environment. Any global identifiers will instead be assumed to be a reference to an abstract operation or a constant. The generator will inject the appropriate bytecode if the name of the global identifier matches a known name. Anything else will cause a code generation error.	2025-11-30 11:54:54 +01:00
Luke Wilde	354888640d	LibJS/Bytecode: Make compilation use SharedFunctionInstanceData instead All the data we need for compilation is in SharedFunctionInstanceData, so we shouldn't depend on ECMAScriptFunctionObject. Allows NativeJavaScriptBackedFunction to compile bytecode.	2025-11-30 11:54:54 +01:00
Andreas Kling	003589db2d	LibJS: Generate C++ bytecode instruction classes from a definition file This commit adds a new Bytecode.def file that describes all the LibJS bytecode instructions. From this, we are able to generate the full declarations for all C++ bytecode instruction classes, as well as their serialization code. Note that some of the bytecode compiler was updated since instructions no longer have default constructor arguments. The big immediate benefit here is that we lose a couple thousand lines of hand-written C++ code. Going forward, this also allows us to do more tooling for the bytecode VM, now that we have an authoritative description of its instructions. Key things to know about: - Instructions can inherit from one another. At the moment, everything simply inherits from the base "Instruction". - @terminator means the instruction terminates a basic block. - @nothrow means the instruction cannot throw. This affects how the interpreter interacts with it. - Variable-length instructions are automatically supported. Just put an array of something as the last field of the instruction. - The m_length field is magical. If present, it will be populated with the full length of the instruction. This is used for variable-length instructions.	2025-11-21 09:46:03 +01:00
Luke Wilde	d0ef1aad2d	LibJS/Bytecode: Merge adjacent exception handlers For example, this: ``` Exception handlers: from 678 to 698 handler 658 finalizer 0 from 698 to 6f8 handler 658 finalizer 0 from 6f8 to 708 handler 658 finalizer 0 from 708 to 750 handler 658 finalizer 0 from 750 to 788 handler 658 finalizer 0 from 788 to 7a0 handler 658 finalizer 0 from 7a0 to 7a8 handler 658 finalizer 0 ``` Becomes: ``` Exception handlers: from 678 to 7a8 handler 658 finalizer 0 ```	2025-11-07 09:57:06 +01:00

1 2

85 commits