ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2026-04-19 02:10:26 +00:00

Author	SHA1	Message	Date
Andreas Kling	6cdfbd01a6	LibJS: Add alternative source-to-bytecode pipeline in Rust Implement a complete Rust reimplementation of the LibJS frontend: lexer, parser, AST, scope collector, and bytecode code generator. The Rust pipeline is built via Corrosion (CMake-Cargo bridge) and linked into LibJS as a static library. It is gated behind a build flag (ENABLE_RUST, on by default except on Windows) and two runtime environment variables: - LIBJS_CPP: Use the C++ pipeline instead of Rust - LIBJS_COMPARE_PIPELINES=1: Run both pipelines in lockstep, aborting on any difference in AST or bytecode generated. The C++ side communicates with Rust through a C FFI layer (RustIntegration.cpp/h) that passes source text to Rust and receives a populated Executable back via a BytecodeFactory interface.	2026-02-24 09:39:42 +01:00
Andreas Kling	7281091fdb	LibJS: Make bytecode generation infallible Remove CodeGenerationError and make all bytecode generation functions return their results directly instead of wrapping them in CodeGenerationErrorOr. For the few remaining sites where codegen encounters an unimplemented or unexpected AST node, we now use a new emit_todo() helper that emits a NewTypeError + Throw sequence at compile time (preserving the runtime behavior) and then switches to a dead basic block so subsequent codegen for the same function can continue without issue. This allows us to remove error handling from all callers of the bytecode compiler, simplifying the code significantly.	2026-02-12 11:37:43 +01:00
Andreas Kling	e308e73120	LibJS: Move SharedFunctionInstanceData creation out of FunctionNode Add static factory methods create_for_function_node() on SharedFunctionInstanceData and update all callers to use them instead of FunctionNode::ensure_shared_data(). This removes the GC::Root<SharedFunctionInstanceData> cache from FunctionNode, eliminating the coupling between the RefCounted AST and GC-managed runtime objects. The cache was effectively dead code: hoisted declarations use m_functions_to_initialize directly, and function expressions always create fresh instances during codegen.	2026-02-11 23:57:41 +01:00
Andreas Kling	680487fa9a	LibJS: Rewrite eval declaration instantiation using metadata Change eval_declaration_instantiation to take EvalDeclarationData& instead of Program const&. The function body now iterates pre-computed name lists instead of walking the AST. Both callers (perform_eval and perform_shadow_realm_eval) now build EvalDeclarationData before calling eval_declaration_instantiation. This decouples the runtime declaration-instantiation API from AST types, matching the pattern already used by Script for global declaration instantiation.	2026-02-11 23:57:41 +01:00
Andreas Kling	87b9795d75	LibJS: Pre-compute eval declaration instantiation data Add EvalDeclarationData struct that holds pre-computed metadata extracted from the Program AST: var names, functions to initialize, declared function names, var scoped names, AnnexB candidates, and lexical bindings. This mirrors the pattern used by Script for global declaration instantiation, and prepares for decoupling eval_declaration_instantiation from the AST.	2026-02-11 23:57:41 +01:00
Andreas Kling	9ea5aa93f8	LibJS: Pre-store formal parameter runtime data Replace the runtime uses of formal_parameters() with pre-computed data: - m_formal_parameter_count stores the parameter count - m_parameter_names_for_mapped_arguments stores ordered parameter names for simple parameter lists (used by create_mapped_arguments_object) Change create_mapped_arguments_object to take Span<Utf16FlyString> instead of NonnullRefPtr<FunctionParameters const>. Remove virtual formal_parameters() from FunctionObject as it is no longer needed.	2026-02-11 23:57:41 +01:00
Andreas Kling	4d92c4d71a	LibJS: Skip initializing constant slots in ExecutionContext Every function call allocates an ExecutionContext with a trailing array of Values for registers, locals, constants, and arguments. Previously, the constructor would initialize all slots to js_special_empty_value(), but constant slots were then immediately overwritten by the interpreter copying in values from the Executable before execution began. To eliminate this redundant initialization, we rearrange the layout from [registers \| constants \| locals] to [registers \| locals \| constants]. This groups registers and locals together at the front, allowing us to initialize only those slots while leaving constant slots uninitialized until they're populated with their actual values. This reduces the per-call initialization cost from O(registers + locals + constants) to O(registers + locals). Also tightens up the types involved (size_t -> u32) and adds VERIFYs to guard against overflow when computing the combined slot counts, and to ensure the total fits within the 29-bit operand index field.	2026-01-19 10:48:12 +01:00
Andreas Kling	5214e30182	LibJS: Shrink FunctionEnvironment by reordering members a bit	2025-12-21 12:08:41 -06:00
Andreas Kling	72b95d79d2	LibJS: Shrink DeclarativeEnvironment by shrinking DisposeCapability	2025-12-21 12:08:41 -06:00
Andreas Kling	e6521d8ead	LibJS: Mark ArgumentsObject as non-interfering if parameter list empty If the parameter list is empty, there are no mappings to worry about. This allows various internal property access optimizations. 1.17x speedup on Octane/raytrace.js	2025-12-10 17:40:57 -06:00
Andreas Kling	cb23d65625	LibJS: Pass JS::Value directly to string formatting functions We don't need to call .to_string_without_side_effects() when passing a JS::Value in for string formatting. The Formatter will do it for us.	2025-12-09 21:44:13 -06:00
Luke Wilde	a63b0cfaba	LibJS: Introduce NativeJavaScriptBackedFunction This hosts the ability to compile and run JavaScript to implement native functions. This is particularly useful for any native function that is not a normal function, for example async functions such as Array.fromAsync, which require yielding. These functions are not allowed to observe anything from outside their environment. Any global identifiers will instead be assumed to be a reference to an abstract operation or a constant. The generator will inject the appropriate bytecode if the name of the global identifier matches a known name. Anything else will cause a code generation error.	2025-11-30 11:54:54 +01:00
Psychpsyo	100f37995f	Everywhere: Clean up AD-HOC and FIXME comments without colons	2025-11-13 15:56:04 +01:00
Andreas Kling	0dacc94edd	LibJS: Have JS::Lexer take a JS::SourceCode as input This moves the responsibility of setting up a SourceCode object to the users of JS::Lexer. This means Lexer and Parser are free to use string views into the SourceCode internally while working. It also means Lexer no longer has to think about anything other than UTF-16 (or ASCII) inputs. So the unit test for parsing various invalid UTF-8 sequences is deleted here.	2025-11-09 12:14:03 +01:00
Andreas Kling	5706831328	LibJS: Make run_executable() return simple ThrowCompletionOr<Value> We don't need to return two values; running an executable only ever produces a throw completion, or a normal completion, i.e a Value. This necessitated a few minor changes, such as adding a way to check if a JS::Cell is a GeneratorResult.	2025-10-31 08:56:02 +01:00
Andreas Kling	9dae1acc31	LibJS: Pass ExecutionContext to Interpreter::run_executable() This avoids having to get it from the VM's context stack, since most callers already have it on hand.	2025-10-29 21:20:10 +01:00
Andreas Kling	fdb85a330e	LibJS: Stop tracking whether execution context is strict mode or not This was only used for basic testing, and forced us to plumb this flag flag in a bunch of places.	2025-10-29 21:20:10 +01:00
Andreas Kling	fb05063dde	LibJS: Let bytecode instructions know whether they are in strict mode This commits puts the strict mode flag in the header of every bytecode instruction. This allows us to check for strict mode without looking at the currently running execution context.	2025-10-29 21:20:10 +01:00
Feng Yu	61c36e2865	LibJS: Sync additional Import Attributes spec changes Some steps were not updated with tc39/ecma262#3057. This patch syncs the remaining changes.	2025-10-22 10:58:19 +02:00
Andreas Kling	d065171791	LibJS: Use property lookup caches for some of our hot C++ gets We can use caching in a million more places. This is just me running JS benchmarks and looking at which get() call sites were hot and putting caches there. Lots of nice speedups all over the place, some examples: 1.19x speedup on Octane/raytrace.js 1.13x speedup on Octane/earley-boyer.js 1.12x speedup on Kraken/ai-astar.js 1.10x speedup on Octane/box2d.js 1.08x speedup on Octane/gbemu.js 1.05x speedup on Octane/regexp.js	2025-10-14 15:47:38 +02:00
Timothy Flynn	a4991143e0	LibJS: Update spec links and steps for the U8Array base64/hex proposal This proposal reached stage 4 and was merged into ECMA-262. See: `3dfa316`	2025-10-03 09:03:40 +02:00
Aliaksandr Kalenik	a54215c07d	LibJS: Make `internal_define_own_property()` save added property offset ...in `PropertyDescriptor`. This is required for the upcoming change that needs to know offset of newly added properties to set up inline caching.	2025-09-17 12:44:44 +02:00
Andreas Kling	e5b07858a2	LibJS: Allocate Call{Construct,DirectEval,Builtin) contexts up front We already do this for normal Call contexts, so this is just continuing to propagate the same pattern to other instructions. Fixes #6026	2025-08-31 15:24:37 +02:00
Timothy Flynn	70db474cf0	LibJS+LibWeb: Port interned bytecode strings to UTF-16 This was almost a no-op, except we intern JS exception messages. So the bulk of this patch is porting exception messages to UTF-16.	2025-08-14 10:27:08 +02:00
Timothy Flynn	b955c9b2a9	LibJS: Port the Identifier AST (and related) nodes to UTF-16 This eliminates quite a lot of UTF-8 / UTF-16 churn.	2025-08-13 09:56:13 -04:00
Timothy Flynn	0efa98a57a	LibJS+LibWeb+WebContent: Port JS::PropertyKey to UTF-16 This has quite a lot of fall out. But the majority of it is just type or UDL substitution, where the changes just fall through to other function calls. By changing property key storage to UTF-16, the main affected areas are: * NativeFunction names must now be UTF-16 * Bytecode identifiers must now be UTF-16 * Module/binding names must now be UTF-16	2025-08-05 07:07:15 -04:00
Timothy Flynn	a43cb15e81	LibJS+LibWeb: Replace JS::Utf16String with AK::Utf16String	2025-07-18 12:45:38 -04:00
Timothy Flynn	2803d66d87	AK: Support UTF-16 string formatting The underlying storage used during string formatting is StringBuilder. To support UTF-16 strings, this patch allows callers to specify a mode during StringBuilder construction. The default mode is UTF-8, for which StringBuilder remains unchanged. In UTF-16 mode, we treat the StringBuilder's internal ByteBuffer as a series of u16 code units. Appending a single character will append 2 bytes for that character (cast to a char16_t). Appending a StringView will transcode the string to UTF-16. Utf16String also gains the same memory optimization that we added for String, where we hand-off the underlying buffer to Utf16String to avoid having to re-allocate. In the future, we may want to further optimize for ASCII strings. For example, we could defer committing to the u16-esque storage until we see a non-ASCII code point.	2025-07-18 12:45:38 -04:00
Timothy Flynn	fe676585f5	AK: Add a UTF-16 string with optimized short- and ASCII-string storage This is a strictly UTF-16 string with some optimizations for ASCII. * If created from a short UTF-8 or UTF-16 string that is also ASCII, then the string is stored in an inlined byte buffer. * If created with a long UTF-8 or UTF-16 string that is also ASCII, then the string is stored in an outlined char buffer. * If created with a short or long UTF-8 or UTF-16 string that is not ASCII, then the string is stored in an outlined char16 buffer. We do not store short non-ASCII text in the inlined buffer to avoid confusion with operations such as `length_in_code_units` and `code_unit_at`. For example, "😀" would be stored as 4 UTF-8 bytes in short string form. But we still want `length_in_code_units` to be 2, and `code_unit_at(0)` to be 0xD83D.	2025-07-18 12:45:38 -04:00
Luke Wilde	3d43462ccd	LibJS: Implement the Dynamic Code Brand Checks stage 3 proposal This is an active proposal at stage 3 of the TC39 proposal process. See: https://tc39.es/proposal-dynamic-code-brand-checks/ See: https://github.com/tc39/proposal-dynamic-code-brand-checks This proposal essentially adds support for the TrustedScript type from the Trusted Types specification to eval and Function. This in turn pipes support for the type into the CSP hook to check if the CSP allows dynamic code compilation. However, it currently doesn't support ShadowRealms, so the implementation here is a close approximation, using PerformEval as the basis. See: https://github.com/tc39/proposal-dynamic-code-brand-checks/issues/19 This is required to support the new function signature for the CSP hook, and will allow us to slot in Trusted Types support in the future.	2025-07-09 15:52:54 -06:00
Timothy Flynn	62d9a84b8d	AK+Everywhere: Replace custom number parsers with fast_float Our floating point number parser was based on the fast_float library: https://github.com/fastfloat/fast_float However, our implementation only supports 8-bit characters. To support UTF-16, we will need to be able to convert char16_t-based strings to numbers as well. This works out-of-the-box with fast_float. We can also use fast_float for integer parsing.	2025-07-03 09:51:56 -04:00
Timothy Flynn	9fc3e72db2	AK+Everywhere: Allow lonely UTF-16 surrogates by default By definition, the web allows lonely surrogates by default. Let's have our string APIs reflect this, so we don't have to pass an allow option all over the place.	2025-07-03 09:51:56 -04:00
Timothy Flynn	86b1c78c1a	AK+Everywhere: Prepare Utf16View for integration with a UTF-16 string To prepare for an upcoming Utf16String, this migrates Utf16View to store its data as a char16_t. Most function definitions are moved inline and made constexpr. This also adds a UDL to construct a Utf16View from a string literal: auto string = u"hello"sv; This let's us remove the NTTP Utf16View constructor, as we have found that such constructors bloat binary size quite a bit.	2025-07-03 09:51:56 -04:00
Andreas Kling	a0864dbb26	LibJS: Make mapped arguments objects way less allocation-happy By following the spec to the letter, our mapped arguments objects ended up with many extra GC allocations: - 1 extra Object for the internal [[ParameterMap]]. - 2 extra NativeFunctions for each mapped parameter accessor. - 1 extra Accessor to hold the aforementioned NativeFunctions. This patch removes all those allocations and lets ArgumentsObject model the desired behavior in custom C++ instead of using script primitives. 1.06x speedup on Speedometer's TodoMVC-jQuery.	2025-05-11 14:00:40 +02:00
Timothy Flynn	3867a192a1	LibJS: Update spec steps / links for the import-assertions proposal This proposal has reached stage 4 and been merged into the main ECMA-262 spec. See: `4e3450e`	2025-04-29 07:33:08 -04:00
Andreas Kling	a05be67e4a	LibJS: Let invokers (callers) of [[Call]] allocate ExecutionContext Instead of letting every [[Call]] implementation allocate an ExecutionContext, we now make that a responsibility of the caller. The main point of this exercise is to allow the Call instruction to write function arguments directly into the callee ExecutionContext instead of copying them later. This makes function calls significantly faster: - 10-20% faster on micro-benchmarks (depending on argument count) - 4% speedup on Kraken - 2% speedup on Octane - 5% speedup on JetStream	2025-04-28 01:23:56 +02:00
Andrew Kaster	9bae24cc4a	LibJS: Add and use ValidateNonRevokedProxy AO This refactor is from two editorial changes to the spec from a while back. `44d1cae2b2` `21ffeee869`	2025-04-24 10:37:39 +02:00
Aliaksandr Kalenik	a329868c1b	LibJS: Allocate ExecutionContext memory using alloca() when possible This should be faster than heap allocation. However, heap allocation is still necessary in some cases, such as with generators and async functions.	2025-04-24 10:30:52 +02:00
Aliaksandr Kalenik	c6cd03d7ca	LibJS+LibWeb: Join arguments into vector of registers+constants+locals This is better because: - Better data locality - Allocate vector for registers+constants+locals+arguments in one go instead of allocating two vectors separately	2025-04-24 10:30:52 +02:00
Aliaksandr Kalenik	80a8040794	LibJS+LibWeb: Calculate count of regs+consts+locals before EC allocation This is a preparation step before joining arguments vector into vector of registers+constants+locals.	2025-04-24 10:30:52 +02:00
Andreas Kling	669b1131ad	LibJS: Streamline CreateMappedArgumentsObject [[ParameterMap]] creation Instead of using the more generic define_native_accessor() here, we poke directly at indexed property storage for the parameter map. We can also construct the NativeFunction objects directly, without giving them names like "get 0" etc, since these are not observable by userspace. Finally, by using default property attributes (not observable anyway), we get simple indexed storage instead of generic (hash map) storage.	2025-04-20 18:43:11 +02:00
Andreas Kling	e0b32b1863	LibJS: Use premade shape when creating mapped arguments objects Knocks out a 0.4% profile item on Speedometer 3.	2025-04-19 01:14:02 +02:00
Andreas Kling	e8c351505e	LibJS: Use premade shape when creating unmapped arguments objects Takes Speedometer2.1/EmberJS-Debug-TodoMVC from ~4500ms to ~4000ms on my M3 MacBook Pro.	2025-04-15 13:08:27 +02:00
Andreas Kling	2a9b6f1d97	LibJS: Move computation out of the ECMAScriptFunctionObject constructor We were doing way too much computation every time an ESFO was instantiated. This was particularly sad, since the results of these computations were identical every time! This patch adds a new SharedFunctionInstanceData object that gets shared between all instances of an ESFO instantiated from some kind of AST FunctionNode. ~5% speedup on Speedometer 2.1 :^)	2025-04-08 18:52:35 +02:00
Andreas Kling	4209b18b88	LibJS: Add ECMAScriptFunctionObject::create_from_function_node() helper This gives us a shared entry point for every situation where we instantiate a function based on a FunctionNode from the AST.	2025-04-08 18:52:35 +02:00
Andreas Kling	3cf50539ec	LibJS: Make Value() default-construct the undefined value The special empty value (that we use for array holes, Optional<Value> when empty and a few other other placeholder/sentinel tasks) still exists, but you now create one via JS::js_special_empty_value() and check for it with Value::is_special_empty_value(). The main idea here is to make it very unlikely to accidentally create an unexpected special empty value.	2025-04-05 11:20:26 +02:00
Andreas Kling	de424d6879	LibJS: Make Completion.[[Value]] non-optional Instead, just use js_undefined() whenever the [[Value]] field is unused. This avoids a whole bunch of presence checks.	2025-04-05 11:20:26 +02:00
Andreas Kling	c71772126f	LibJS: Remove ByteString internals from PrimitiveString PrimitiveString is now internally either UTF-8, UTF-16, or both. We no longer convert them to/from ByteString anywhere, nor does VM have a ByteString cache.	2025-03-28 12:31:40 -04:00
Andreas Kling	7477002e46	LibJS: Keep parsed function parameters in a shared data structure Instead of making a copy of the Vector<FunctionParameter> from the AST every time we instantiate an ECMAScriptFunctionObject, we now keep the parameters in a ref-counted FunctionParameters object. This reduces memory usage, and also allows us to cache the bytecode executables for default parameter expressions without recompiling them for every instantiation. :^)	2025-03-27 15:00:43 +00:00
Andreas Kling	46a5710238	LibJS: Use FlyString in PropertyKey instead of DeprecatedFlyString This required dealing with substantial fallout.	2025-03-24 22:27:17 +00:00

1 2

63 commits