ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2026-04-19 02:10:26 +00:00

Author	SHA1	Message	Date
Andreas Kling	47e552e8fd	LibJS: Consolidate TDZ check emission into Generator helper Move the duplicated ThrowIfTDZ emission logic from three places in ASTCodegen.cpp into a single Generator::emit_tdz_check_if_needed() helper. This handles both argument TDZ (which requires a Mov to empty first) and lexically-declared variable TDZ uniformly. This avoids emitting some unnecessary ThrowIfTDZ instructions.	2026-02-17 20:44:57 +01:00
Andreas Kling	49f2f1e7cd	LibJS: Skip unnecessary Mov in emit_load_from_reference for reads When MemberExpression::generate_bytecode calls emit_load_from_reference, it only uses the loaded_value and discards the reference operands. For computed member expressions (e.g. a[0]), this was generating an unnecessary Mov to save the property register for potential store-back. Add a ReferenceMode parameter to emit_load_from_reference. When LoadOnly is passed, the computed property path skips the register save and Mov.	2026-02-15 23:21:46 +01:00
Andreas Kling	c0f38c82d8	LibJS: Fix evaluation order in array destructuring assignment Per AssignmentRestElement and AssignmentElement in the specification, the DestructuringAssignmentTarget reference must be evaluated before iterating or stepping the iterator. We were doing it in the wrong order, which caused observable differences when the target evaluation has side effects, and could lead to infinite loops when the iterator never completes. Add Generator::emit_evaluate_reference() to evaluate a member expression's base and property into ReferenceOperands without performing a load or store, then use the pre-evaluated reference for the store after iteration completes.	2026-02-15 23:21:46 +01:00
Andreas Kling	7281091fdb	LibJS: Make bytecode generation infallible Remove CodeGenerationError and make all bytecode generation functions return their results directly instead of wrapping them in CodeGenerationErrorOr. For the few remaining sites where codegen encounters an unimplemented or unexpected AST node, we now use a new emit_todo() helper that emits a NewTypeError + Throw sequence at compile time (preserving the runtime behavior) and then switches to a dead basic block so subsequent codegen for the same function can continue without issue. This allows us to remove error handling from all callers of the bytecode compiler, simplifying the code significantly.	2026-02-12 11:37:43 +01:00
Andreas Kling	4c7a349b62	LibJS: Remove #include <AST.h> from SharedFunctionInstanceData.h Extract FunctionParsingInsights into its own header and introduce FunctionLocal as a standalone mirror of Identifier::Local. This allows SharedFunctionInstanceData.h to avoid pulling in the full AST type hierarchy, reducing transitive include bloat. The AST.h include is kept in SharedFunctionInstanceData.cpp where it's needed for the constructor that accesses AST node types.	2026-02-11 23:57:41 +01:00
Andreas Kling	6decb93dd7	LibJS: Populate ClassBlueprint during codegen Build a ClassBlueprint from ClassExpression elements at codegen time: - Methods/getters/setters: register SharedFunctionInstanceData from the method's FunctionExpression - Field initializers with literal values (numbers, booleans, null, strings, negated numbers): store the value directly, avoiding function creation entirely - Field initializers with non-literal values: wrap in ClassFieldInitializerStatement and create SharedFunctionInstanceData - Static initializers: create SharedFunctionInstanceData from the function body - Constructor: register SharedFunctionInstanceData from the constructor's FunctionExpression Add public accessors to ClassMethod::function() and StaticInitializer::function_body() for codegen access. The blueprint is registered but not yet used by NewClass (dual path). No behavioral change.	2026-02-11 23:57:41 +01:00
Andreas Kling	6b0003b057	LibJS: Pre-create SharedFunctionInstanceData in NewFunction Replace the FunctionNode const& stored on the NewFunction bytecode instruction with an index into a table of pre-created SharedFunctionInstanceData objects on the Executable. During bytecode compilation, we now eagerly create SharedFunctionInstanceData for each function that will be instantiated by NewFunction, and store it on both the FunctionNode (for caching) and the Executable (for GC tracing). At runtime, NewFunction simply looks up the SharedFunctionInstanceData by index and calls create_from_function_data() directly, bypassing the AST entirely. This removes one of the main reasons the AST had to stay alive after compilation. The instantiate_ordinary_function_expression() helper in Interpreter.cpp is removed as its non-trivial code path (creating a scope for named function expressions) was dead code -- it was only called when !has_name(), so the has_own_name branch never executed.	2026-02-11 23:57:41 +01:00
Andreas Kling	479b89aa6d	LibJS: Fix UpdateEmpty completion value semantics for loops/switch/if When a loop or switch body produces an abrupt completion (break or continue) with an empty value, the ES spec requires UpdateEmpty to replace the empty value with the last non-empty completion value V. The bytecode compiler was failing to do this because it only updated the completion register after body codegen, guarded by !is_current_block_terminated(). When break/continue terminated the block, the update was skipped. Fix this with three changes: 1. Introduce a CompletionRegisterScope that tells ScopeNode::generate_bytecode to eagerly emit Mov instructions into the completion register after each value-producing statement. This ensures the register is up to date before any break or continue fires. 2. Give IfStatement its own CompletionRegisterScope (initialized to undefined) during branch evaluation. This models the spec's UpdateEmpty(stmtCompletion, undefined) for if-statements: when break/continue fires inside an if-branch, the scoped jump propagation sees that the if's completion register differs from the loop's and emits a Mov, correctly replacing the eagerly written value with undefined. Without this, code like { 3; if (true) { break; } else { } } would incorrectly carry the value 3 instead of undefined through the break. 3. Capture loop body results and emit a fallback Mov for non-ScopeNode bodies (e.g. bare expression statements like do x=1; while(false)) that don't participate in the eager CompletionRegisterScope update mechanism. For labelled break/continue that cross loop boundaries, the jump codegen now propagates the inner completion register to the target scope's completion register before emitting the jump. Also fix ForStatement to use a proper completion register (previously it returned the body result directly, which was wrong for empty bodies and break-with-no-value cases).	2026-02-11 14:29:36 +01:00
Andreas Kling	720fd567b1	LibJS: Collapse handler/finalizer into single exception handler target After replacing the runtime unwind context stack with explicit completion records for try/finally dispatch, the distinction between "handler" (catch) and "finalizer" (finally) in the exception handler table is no longer meaningful at runtime. handle_exception() checked handler first, then finalizer, but they did the exact same thing (set the PC). When both were present, the finalizer was dead code. Collapse both fields into a single handler_offset (now non-optional, since an entry always has a target), remove the finalizer concept from BasicBlock, UnwindContext, and ExceptionHandlers, and simplify handle_exception() to a direct assignment.	2026-02-09 16:35:39 +01:00
Andreas Kling	cbca493b28	LibJS: Remove BlockBoundaryType::Unwind With LeaveUnwindContext gone, the Unwind boundary type has no purpose. Remove it from the enum and all start/end boundary calls.	2026-02-09 16:35:39 +01:00
Andreas Kling	5abe40874a	LibJS: Remove LeaveUnwindContext opcode LeaveUnwindContext popped the runtime unwind context stack. With the stack being removed, all emission sites become dead code. Remove the opcode and all its emissions.	2026-02-09 16:35:39 +01:00
Andreas Kling	7f89158d20	LibJS: Replace implicit environment stack with explicit registers Replace the saved_lexical_environments stack in ExecutionContextRareData with explicit register-based environment tracking. Environments are now stored in registers and restored via SetLexicalEnvironment, making the environment flow visible in bytecode. Key changes: - Add GetLexicalEnvironment and SetLexicalEnvironment opcodes - CreateLexicalEnvironment takes explicit parent and dst operands - EnterObjectEnvironment stores new environment in a dst register - NewClass takes an explicit class_environment operand - Remove LeaveLexicalEnvironment opcode (instead: SetLexicalEnvironment) - Remove saved_lexical_environments from ExecutionContextRareData - Use a reserved register for the saved lexical environment to avoid dominance issues with lazily-emitted GetLexicalEnvironment	2026-02-09 16:35:39 +01:00
Andreas Kling	a439dc8490	LibJS: Use explicit completion records for try/finally dispatch Each finally scope gets two registers (completion_type and completion_value) that form an explicit completion record. Every path into the finally body sets these before jumping, and a dispatch chain after the finally body routes to the correct continuation. This replaces the old implicit protocol that relied on the exception register, a saved_return_value register, and a scheduled_jump field on ExecutionContext, allowing us to remove: - 5 opcodes (ContinuePendingUnwind, ScheduleJump, LeaveFinally, RestoreScheduledJump, PrepareYield) - 1 reserved register (saved_return_value) - 2 ExecutionContext fields (scheduled_jump, previously_scheduled_jumps)	2026-02-09 08:51:12 +01:00
Andreas Kling	d488f9f12f	LibJS: Narrow bytecode source map offsets from size_t to u32 Add VERIFY guards to catch bytecode programs that exceed u32::max bytes and narrow the bytecode_offset parameter in add_source_map_entry() to u32. This is a preparatory change for optimizing source map storage.	2026-01-26 19:37:42 +01:00
dosisod	ac8cc6d24b	LibJS: Constant fold `LogicalExpression` Logical expressions like `true \|\| false` are now constant folded. This also allows for dead code elimination if we know the right-hand side of the expression will never be evaluated (such as `false && f()` or `true \|\| f()`). In the test suites, the values are now being constant folded at compile time. To ensure that the actual evaluation logic is being called properly, I had to duplicate the tests and call them via a function so the compiler would not optimize the evaluation logic away. This also demotes `NaN` and `Infinity` identifiers to `nan` and `inf` double literals, which will further help with const folding.	2026-01-22 08:47:18 +01:00
Andreas Kling	505fe0a977	LibJS: Add shape caching for object literal instantiation When a function creates object literals with simple property names, we now cache the resulting shape after the first instantiation. On subsequent calls, we create the object with the cached shape directly and write property values at their known offsets. This avoids repeated shape transitions and property offset lookups for a common JavaScript pattern. The optimization uses two new bytecode instructions: - CacheObjectShape: Captures the final shape after object construction - InitObjectLiteralProperty: Writes properties using cached offsets Only "simple" object literals are optimized (string literal keys with simple value expressions). Complex cases like computed properties, getters/setters, and spread elements use the existing slow path. 3.4x speedup on a microbenchmark that repeatedly instantiates an object literal with 26 properties. Small progressions on various benchmarks.	2026-01-10 00:56:51 +01:00
Luke Wilde	c4c9ac08ad	LibJS: Follow the spec more closely for tagged template literals This resolves a FIXME in its code generation, particularly for: - Caching the template object - Setting the correct property attributes - Freezing the resulting objects This allows archive.org to load, which uses the Lit library. The Lit library caches these template objects to determine if a template has changed, allowing it to determine to do a full template rerender or only partially update the rendering. Before, we would always cause a full rerender on update because we didn't return the same template object. This caused issues with archive.org's code, I believe particularly with its router library, where we would constantly detach and reattach nodes unexpectedly, ending up with the page content not being attached to the router's custom element.	2026-01-06 23:25:36 +01:00
Andreas Kling	ece0b72e3c	LibJS: Don't set [[HomeObject]] for non-method object properties This fixes an issue where we'd incorrectly retain objects via the [[HomeObject]] slot. This common pattern was affected: Object.defineProperty(o, "foo", { get: function() { return 123; } }); Above, the object literal would get assigned to the [[HomeObject]] slot even though "get" is not a "method" per the spec. This frees about 30,000 objects on my x.com home feed.	2025-12-17 12:50:17 -06:00
Andreas Kling	bad16dc0e0	LibJS: Cache fully-formed PropertyKeys in Executable Instead of creating PropertyKeys on the fly during interpreter execution, we now store fully-formed ones in the Executable. This avoids a whole bunch of busywork in property access instructions and substantially reduces code size bloat.	2025-12-11 14:34:45 -06:00
Luke Wilde	a63b0cfaba	LibJS: Introduce NativeJavaScriptBackedFunction This hosts the ability to compile and run JavaScript to implement native functions. This is particularly useful for any native function that is not a normal function, for example async functions such as Array.fromAsync, which require yielding. These functions are not allowed to observe anything from outside their environment. Any global identifiers will instead be assumed to be a reference to an abstract operation or a constant. The generator will inject the appropriate bytecode if the name of the global identifier matches a known name. Anything else will cause a code generation error.	2025-11-30 11:54:54 +01:00
Luke Wilde	354888640d	LibJS/Bytecode: Make compilation use SharedFunctionInstanceData instead All the data we need for compilation is in SharedFunctionInstanceData, so we shouldn't depend on ECMAScriptFunctionObject. Allows NativeJavaScriptBackedFunction to compile bytecode.	2025-11-30 11:54:54 +01:00
Andreas Kling	003589db2d	LibJS: Generate C++ bytecode instruction classes from a definition file This commit adds a new Bytecode.def file that describes all the LibJS bytecode instructions. From this, we are able to generate the full declarations for all C++ bytecode instruction classes, as well as their serialization code. Note that some of the bytecode compiler was updated since instructions no longer have default constructor arguments. The big immediate benefit here is that we lose a couple thousand lines of hand-written C++ code. Going forward, this also allows us to do more tooling for the bytecode VM, now that we have an authoritative description of its instructions. Key things to know about: - Instructions can inherit from one another. At the moment, everything simply inherits from the base "Instruction". - @terminator means the instruction terminates a basic block. - @nothrow means the instruction cannot throw. This affects how the interpreter interacts with it. - Variable-length instructions are automatically supported. Just put an array of something as the last field of the instruction. - The m_length field is magical. If present, it will be populated with the full length of the instruction. This is used for variable-length instructions.	2025-11-21 09:46:03 +01:00
Andreas Kling	fb05063dde	LibJS: Let bytecode instructions know whether they are in strict mode This commits puts the strict mode flag in the header of every bytecode instruction. This allows us to check for strict mode without looking at the currently running execution context.	2025-10-29 21:20:10 +01:00
Andreas Kling	e7a3c4dbad	LibJS: Rename Bytecode::Op::PropertyKind => Bytecode::PutKind This is only used to specify how a property is being added to an object by Put* instructions, so let's call it PutKind. Also add an enumeration X macro for it to prepare for upcoming specializations.	2025-10-11 20:08:58 +02:00
Andreas Kling	46c6176235	LibJS: Cache bytecode constant strings with their Utf16String as key	2025-10-05 21:44:06 +02:00
Aliaksandr Kalenik	e81833423b	LibJS: Add PutByNumericId and change PutById to be string key only Previously, PutById constructed a PropertyKey from the identifier, which coerced numeric-like strings to numbers. This moves that decision to bytecode generation: the bytecode generator now emits PutByNumericId for numeric keys and PutById for string keys. This removes per-execution parsing from the interpreter. 1.4x speedup on the following microbenchmark: ```js const o = {}; for (let i = 0; i < 10_000_000; i++) { o.a = 1; o.b = 2; o.c = 3; } ```	2025-09-13 20:02:28 +02:00
Timothy Flynn	70db474cf0	LibJS+LibWeb: Port interned bytecode strings to UTF-16 This was almost a no-op, except we intern JS exception messages. So the bulk of this patch is porting exception messages to UTF-16.	2025-08-14 10:27:08 +02:00
Timothy Flynn	cf61171864	LibJS: Port remaining bytecode identifiers to UTF-16	2025-08-14 10:27:08 +02:00
Timothy Flynn	0efa98a57a	LibJS+LibWeb+WebContent: Port JS::PropertyKey to UTF-16 This has quite a lot of fall out. But the majority of it is just type or UDL substitution, where the changes just fall through to other function calls. By changing property key storage to UTF-16, the main affected areas are: * NativeFunction names must now be UTF-16 * Bytecode identifiers must now be UTF-16 * Module/binding names must now be UTF-16	2025-08-05 07:07:15 -04:00
ayeteadoe	539a675802	LibJS: Revert Enable EXPLICIT_SYMBOL_EXPORT This reverts commit `c14173f651`. We should only annotate the minimum number of symbols that external consumers actually use, so I am starting from scratch to do that	2025-07-22 11:51:29 -04:00
ayeteadoe	c14173f651	LibJS: Enable EXPLICIT_SYMBOL_EXPORT	2025-06-30 10:50:36 -06:00
Julien Le Bras	3ba6d129df	LibJS: Cache string constants in Generator::add_constant This mirrors the existing caching logic for int32 constants. Avoids duplication of string constants in m_constants which could result in stack overflows for large scripts with a lot of similar strings.	2025-06-01 18:25:59 +02:00
Shannon Booth	f2fb86abea	LibJS: Always emit value in emit_named_evaluation_if_anonymous_function There does not appear to be any case that we need to return an OptionalNone{}.	2025-05-23 03:25:55 +02:00
Aliaksandr Kalenik	db480b1f0c	LibJS: Preserve information about local variables declaration kind This is required for upcoming change where we want to emit ThrowIfTDZ for assignment expressions only for lexical declarations.	2025-05-06 12:06:23 +02:00
Aliaksandr Kalenik	2d732b2251	LibJS: Skip allocating locals for arguments that allowed to be local This allows us to get rid of instructions that move arguments to locals and allocate smaller JS::Value vector in ExecutionContext by reusing slots that were already allocated for arguments. With this change for following function: ```js function f(x, y) { return x + y; } ``` we now produce following bytecode: ``` [ 0] 0: Add dst:reg6, lhs:arg0, rhs:arg1 [ 10] Return value:reg6 ``` instead of: ``` [ 0] 0: GetArgument 0, dst:x~1 [ 10] GetArgument 1, dst:y~0 [ 20] Add dst:reg6, lhs:x~1, rhs:y~0 [ 30] Return value:reg6 ```	2025-04-26 11:02:29 +02:00
Andreas Kling	3cf50539ec	LibJS: Make Value() default-construct the undefined value The special empty value (that we use for array holes, Optional<Value> when empty and a few other other placeholder/sentinel tasks) still exists, but you now create one via JS::js_special_empty_value() and check for it with Value::is_special_empty_value(). The main idea here is to make it very unlikely to accidentally create an unexpected special empty value.	2025-04-05 11:20:26 +02:00
Andreas Kling	3169747989	LibJS: Emit PutById instead of PutByValue when key is string literal Basically convert o["foo"]=x into o.foo=x when emitting bytecode. These are effectively the same thing, and the latter format opts into using an inline cache for the property lookups.	2025-04-03 18:47:38 +02:00
Andreas Kling	4426c50a18	LibJS: Emit GetById instead of GetByValue when key is string literal Basically convert o["foo"] into o.foo when emitting bytecode. These are effectively the same thing, and the latter format opts into using an inline cache for the property lookups.	2025-04-03 18:47:38 +02:00
Lucien Fiorini	5707076b9e	LibJS: Optimize away Mov instructions when the source is the destination	2025-03-28 11:21:12 +00:00
Lucien Fiorini	6b6e13e28c	LibJS: Avoid emptying the return value register in try/finally This works because at the end of the finally chunk, a ContinuePendingUnwind is generated which copies the saved return value register into the return value register. In cases where ContinuePendingUnwind is not generated such as when there is a break statement in the finally block, the fonction will return undefined which is consistent with V8 and SpiderMonkey.	2025-03-27 12:18:30 +00:00
Andreas Kling	46a5710238	LibJS: Use FlyString in PropertyKey instead of DeprecatedFlyString This required dealing with substantial fallout.	2025-03-24 22:27:17 +00:00
Andreas Kling	3bfb0534be	LibGC: Rename MarkedVector => RootVector Let's try to make it a bit more clear that this is a Vector of GC roots.	2024-12-26 19:10:44 +01:00
Shannon Booth	f87041bf3a	LibGC+Everywhere: Factor out a LibGC from LibJS Resulting in a massive rename across almost everywhere! Alongside the namespace change, we now have the following names: * JS::NonnullGCPtr -> GC::Ref * JS::GCPtr -> GC::Ptr * JS::HeapFunction -> GC::Function * JS::CellImpl -> GC::Cell * JS::Handle -> GC::Root	2024-11-15 14:49:20 +01:00
Timothy Flynn	93712b24bf	Everywhere: Hoist the Libraries folder to the top-level	2024-11-10 12:50:45 +01:00

44 commits