ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2026-04-19 02:10:26 +00:00

Author	SHA1	Message	Date
Andreas Kling	6cdfbd01a6	LibJS: Add alternative source-to-bytecode pipeline in Rust Implement a complete Rust reimplementation of the LibJS frontend: lexer, parser, AST, scope collector, and bytecode code generator. The Rust pipeline is built via Corrosion (CMake-Cargo bridge) and linked into LibJS as a static library. It is gated behind a build flag (ENABLE_RUST, on by default except on Windows) and two runtime environment variables: - LIBJS_CPP: Use the C++ pipeline instead of Rust - LIBJS_COMPARE_PIPELINES=1: Run both pipelines in lockstep, aborting on any difference in AST or bytecode generated. The C++ side communicates with Rust through a C FFI layer (RustIntegration.cpp/h) that passes source text to Rust and receives a populated Executable back via a BytecodeFactory interface.	2026-02-24 09:39:42 +01:00
Andreas Kling	7e85bb251f	LibJS: Keep SourceCode alive in SharedFunctionInstanceData SharedFunctionInstanceData::m_source_text is a Utf16View into the SourceCode that the function was parsed from. Previously, the SourceCode was kept alive transitively through the AST nodes. As we move towards dropping AST nodes after compilation, we need to ensure the SourceCode outlives the view. Add a RefPtr<SourceCode> to SharedFunctionInstanceData so m_source_text remains valid even after the AST is gone. This is needed for Function.prototype.toString which accesses source_text() at any point during the function's lifetime.	2026-02-11 23:57:41 +01:00
Andreas Kling	e308e73120	LibJS: Move SharedFunctionInstanceData creation out of FunctionNode Add static factory methods create_for_function_node() on SharedFunctionInstanceData and update all callers to use them instead of FunctionNode::ensure_shared_data(). This removes the GC::Root<SharedFunctionInstanceData> cache from FunctionNode, eliminating the coupling between the RefCounted AST and GC-managed runtime objects. The cache was effectively dead code: hoisted declarations use m_functions_to_initialize directly, and function expressions always create fresh instances during codegen.	2026-02-11 23:57:41 +01:00
Andreas Kling	4c7a349b62	LibJS: Remove #include <AST.h> from SharedFunctionInstanceData.h Extract FunctionParsingInsights into its own header and introduce FunctionLocal as a standalone mirror of Identifier::Local. This allows SharedFunctionInstanceData.h to avoid pulling in the full AST type hierarchy, reducing transitive include bloat. The AST.h include is kept in SharedFunctionInstanceData.cpp where it's needed for the constructor that accesses AST node types.	2026-02-11 23:57:41 +01:00
Andreas Kling	5e577bec6a	LibJS: Clear m_formal_parameters and m_ecmascript_code after compilation After bytecode compilation, the formal parameters and ecmascript code AST nodes are no longer needed at runtime. Clear these RefPtrs in clear_compile_inputs() to allow the AST subtrees to be freed.	2026-02-11 23:57:41 +01:00
Andreas Kling	9ea5aa93f8	LibJS: Pre-store formal parameter runtime data Replace the runtime uses of formal_parameters() with pre-computed data: - m_formal_parameter_count stores the parameter count - m_parameter_names_for_mapped_arguments stores ordered parameter names for simple parameter lists (used by create_mapped_arguments_object) Change create_mapped_arguments_object to take Span<Utf16FlyString> instead of NonnullRefPtr<FunctionParameters const>. Remove virtual formal_parameters() from FunctionObject as it is no longer needed.	2026-02-11 23:57:41 +01:00
Andreas Kling	712d3fc54f	LibJS: Pre-compute ScopeNode queries in SharedFunctionInstanceData Pre-compute the data that emit_function_declaration_instantiation previously obtained by querying ScopeNode methods at codegen time: - m_has_scope_body: whether ecmascript_code is a ScopeNode - m_has_non_local_lexical_declarations: from ScopeNode query - m_lexical_bindings: non-local lexically-scoped identifier names and their constant-declaration status After this change, emit_function_declaration_instantiation no longer casts m_ecmascript_code to ScopeNode or calls any ScopeNode methods.	2026-02-11 23:57:41 +01:00
Andreas Kling	d36521a698	LibJS: Replace m_functions_to_initialize with pre-created data Replace Vector<FunctionDeclaration const&> with a FunctionToInitialize struct that stores a pre-created SharedFunctionInstanceData, function name, and local index. The SharedFunctionInstanceData for each hoisted function is created eagerly during the parent's construction, removing the need to reference FunctionDeclaration AST nodes after construction.	2026-02-11 23:57:41 +01:00
Andreas Kling	7cc392551b	LibJS: Replace VariableNameToInitialize with value-type VarBinding Replace VariableNameToInitialize (which holds Identifier const&) with a VarBinding struct that stores pre-extracted values: name, local index, parameter_binding, and function_name. This removes a reference to AST Identifier nodes from SharedFunctionInstanceData, allowing the AST to be freed after compilation.	2026-02-11 23:57:41 +01:00
Andreas Kling	658ba1d023	LibJS: Clear compile-only data from SharedFunctionInstanceData After successful bytecode compilation, the m_functions_to_initialize and m_var_names_to_initialize_binding vectors are no longer needed as they are only consumed by emit_function_declaration_instantiation() during code generation. Add clear_compile_inputs() to release these vectors post-compile, and call it from both ECMAScriptFunctionObject::get_stack_frame_size() and NativeJavaScriptBackedFunction::bytecode_executable() after their respective lazy compilation succeeds. Also add a pre-compile assertion in Generator::generate_from_function() to verify we never try to compile the same function data twice, and a VERIFY in ECMAScriptFunctionObject::ecmascript_code() to guard against null dereference.	2026-02-11 23:57:41 +01:00
Andreas Kling	88d715fc68	LibJS: Eliminate HashMap operations in SFID by caching parser data Cache necessary data during parsing to eliminate HashMap operations in SharedFunctionInstanceData construction. Before: 2 HashMap copies + N HashMap insertions with hash computations After: Direct vector iteration with no hashing Build FunctionScopeData for function scopes in the parser containing: - functions_to_initialize: deduplicated var-scoped function decls - vars_to_initialize: var decls with is_parameter/is_function_name - var_names: HashTable for AnnexB extension checks - Pre-computed counts for environment size calculation - Flags for "arguments" handling Add ScopeNode::ensure_function_scope_data() to compute the data on-demand for edge cases that don't go through normal parser flow (synthetic class constructors, static initializers, module wrappers). Use this cached data directly in SFID with zero HashMap operations.	2026-01-25 23:08:36 +01:00
Jelle Raaijmakers	ae20ecf857	AK+Everywhere: Add Vector::contains(predicate) and use it No functional changes.	2026-01-08 15:27:30 +00:00
Andreas Kling	a9cc425cde	LibJS+LibWeb: Add missing GC marking visits This adds visit_edges(Cell::Visitor&) methods to various helper structs that contain GC pointers, and makes sure they are called from owning GC-heap-allocated objects as needed. These were found by our Clang plugin after expanding its capabilities. The added rules will be enforced by CI going forward.	2026-01-07 12:48:58 +01:00
Andreas Kling	63eccc5640	LibJS: Don't make extra copies of every JS function's source code Instead, let functions have a view into the AST's SourceCode object's underlying string data. The source string is kept alive by the AST, so it's fine to have views into it as long as the AST exists. Reduces memory footprint on my x.com home feed by 65 MiB.	2025-12-21 10:06:04 -06:00
Luke Wilde	2d15326f5d	LibJS: Moved SharedFunctionInstanceData into separate files This will allow it to be reused by NativeJavaScriptBackedFunction.	2025-11-30 11:54:54 +01:00

15 commits