ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2026-06-24 18:30:23 +00:00

Author	SHA1	Message	Date
Shannon Booth	13bae036d8	LibWeb: Do not refer to removed C++ DAFSA code generator	2026-06-05 20:23:24 +02:00
Sam Atkins	bb4f8a6621	LibWeb: Track parser-created style sheets Create parser-blocking style sheets when parser-created `<style>` elements are popped from the stack of open elements, and ignore dynamic style updates while those elements are still open in the parser. Make the shared style-element script-blocking predicate describe the active style sheet instance. Stale script-blocking entries are removed when that style sheet is replaced or removed.	2026-06-04 16:39:54 +01:00
Vanand Gasparyan	f3a3488cda	Rust: Set import granularity to Item By default, `rustfmt` persists the import granularity. In practice, most Rust code has import granularity "Module" due to LSP's actions. "Item" gets rid of import groupings and achieves cleaner diffs and better conflict resolution. Better greppability is a positive side effect. Note: it's an unstable rustfmt feature. `cargo +nightly fmt` must be used instead of `cargo fmt`.	2026-05-28 06:52:18 +02:00
Andreas Kling	411c6654e8	LibWeb: Add Rust preload scanner Add a Rust scanner that walks pending HTML parser input and emits base href updates or speculative fetch candidates. Keep URL parsing and fetch issuance in C++ for now, where the Document and request objects live. Allow the scan callback to stop iteration so the C++ speculative parser can preserve its stop hook once it is wired up. Expose a shared Attribute helper for resolving interned local names and use it from the Rust parser and preload scanner instead of repeating the same lookup pattern. Cover link rel handling, preload destination filtering, crossorigin mapping, and template/foreign-content skipping with Rust unit tests.	2026-05-18 00:23:52 +02:00
Andreas Kling	29784ea397	LibWeb: Remove the C++ HTML tree builder Delete the old C++ tree-construction implementation and helper classes that became unused once the Rust parser is unconditional. Remove the C++ stack of open elements, active formatting elements, speculative mock element, and tree-builder-only token storage. Keep the C++ parser entry points that still own LibWeb DOM integration, encoding detection, tokenizer bridging, incremental parsing, and the speculative parser support used by resource discovery.	2026-05-17 15:35:56 +02:00
Andreas Kling	f49335f210	LibWeb: Align declarative shadow root parsing Teach the Rust parser to recognize declarative shadow root templates and pass the parsed mode, slot assignment, clonable, serializable, and focus-delegation flags to the C++ DOM host. Expose shadowRootSlotAssignment reflection with the spec-defined named missing and invalid value defaults, and extend the ShadowDOM text test coverage for the reflected property and parser-created shadow roots.	2026-05-17 15:35:56 +02:00
Andreas Kling	54879bc916	LibWeb: Complete Rust HTML tree construction Finish the Rust implementation of the spec tree-construction algorithms needed by the LibWeb test suite. Add the remaining table modes, foster parenting, scope helpers, adoption agency handling, ruby/list/form and select cases, frameset state, foreign-content edge cases, and parser host callbacks. Preserve behavior that depends on the C++ DOM integration, including parser-created custom element reactions, fragment quirks mode, arbitrary fragment namespaces, template fragment mode, fragment form ownership, MathML annotation-xml boundaries, contextual fragment scripts, parser script source positions, document.close() parser state, void-element insertion, and duplicate attribute tracking. Add focused tests for the parser edge cases that are easy to regress at the boundary between the Rust tree builder and the C++ DOM host.	2026-05-17 15:35:56 +02:00
Andreas Kling	de12062515	LibWeb: Wire Rust parser scripts and fragments Preserve Rust parser state across tokenizer runs and stop cleanly when a parser-blocking script has to execute. Thread the pending script back through the existing C++ parser entry point so document.write(), input insertion points, and script bookkeeping continue to use the normal LibWeb machinery. Add the fragment parser setup needed by innerHTML and contextual fragment parsing, including context elements, form ownership, tokenizer state selection, text coalescing, and foreign-content integration.	2026-05-17 15:35:56 +02:00
Andreas Kling	2e9875770e	LibWeb: Add initial Rust HTML tree construction Implement the first Rust tree builder pass around the tokenizer and the LibWeb DOM host hooks. Cover the document setup, insertion-mode dispatch, ordinary body insertion, basic table handling, active formatting element reconstruction, and foreign-content routing. Leave the C++ parser available at runtime so the new path can be tested against the old implementation while the remaining tree-construction algorithms are filled in.	2026-05-17 15:35:56 +02:00
Andreas Kling	09296315c2	LibWeb: Add Rust HTML parser host plumbing Add the C++ and Rust scaffolding that lets the tree builder live in Rust while the DOM remains owned by LibWeb. Keep the exported surface small: Rust stores parser state, and C++ provides node creation, insertion, script, template, and GC hooks. Route dump-html-tree through the selectable parser backend so the new implementation can be exercised beside the existing parser while it is being brought up.	2026-05-17 15:35:56 +02:00
Andreas Kling	5b63cb5f37	LibWeb: Avoid unsafe tokenizer state conversion Replace the FFI tokenizer state transmute with an explicit conversion from the incoming numeric value. The old code range-checked against the last state before transmuting, which matched today's contiguous enum but left the conversion dependent on that layout detail. Returning early for unknown values keeps the FFI boundary tolerant while removing a source of possible invalid enum discriminants.	2026-05-17 15:35:56 +02:00
Andreas Kling	171e3adf01	LibWeb: Replace the HTML tokenizer with Rust Replace the C++ HTML tokenizer with a Rust implementation behind the existing HTMLTokenizer API. Keep the parser-facing integration points for streaming input, insertion points, document.write(), EOF insertion, parser aborts, speculative parser input, and last start tag tracking. The generated FFI handle stays an implementation detail of HTMLTokenizer, so callers keep a single tokenizer class. Preserve duplicate attributes through FFI so C++ token normalization can record the duplicate-attribute signal used by CSP nonce checks. Keep bulk tag-name and attribute scans capped at the active insertion point so streamed parser input is spliced at the right offset. Use generated DAFSA tables for named character references and intern common tag and attribute names to reduce FFI marshalling overhead. This also fixes attribute name source positions, nested old insertion points, and aborted fast-path handling. TestHTMLTokenizer covers duplicate attributes and insertion points in fast tag-name, attribute-name, and quoted-value scans. A CSP text test covers duplicate nonce attributes on parser-created script elements. The tokenizer dump fixtures still match, TestHTMLTokenizer passes, and the full release test-web run passes with 6981 tests and 226 skipped.	2026-05-15 21:01:40 +02:00

12 commits