ladybird/Libraries/LibHTTP/HTTP.cpp
Timothy Flynn 9375660b64 LibHTTP+LibWeb+RequestServer: Move Fetch's HTTP header infra to LibHTTP
The end goal here is for LibHTTP to be the home of our RFC 9111 (HTTP
caching) implementation. We currently have one implementation in LibWeb
for our in-memory cache and another in RequestServer for our disk cache.

The implementations both largely revolve around interacting with HTTP
headers. But in LibWeb, we are using Fetch's header infra, and in RS we
are using are home-grown header infra from LibHTTP.

So to give these a common denominator, this patch replaces the LibHTTP
implementation with Fetch's infra. Our existing LibHTTP implementation
was not particularly compliant with any spec, so this at least gives us
a standards-based common implementation.

This migration also required moving a handful of other Fetch AOs over
to LibHTTP. (It turns out these AOs were all from the Fetch/Infra/HTTP
folder, so perhaps it makes sense for LibHTTP to be the implementation
of that entire set of facilities.)
2025-11-27 14:57:29 +01:00

77 lines
2.3 KiB
C++

/*
* Copyright (c) 2022, Luke Wilde <lukew@serenityos.org>
*
* SPDX-License-Identifier: BSD-2-Clause
*/
#include <AK/GenericLexer.h>
#include <AK/StringBuilder.h>
#include <LibHTTP/HTTP.h>
namespace HTTP {
// https://fetch.spec.whatwg.org/#collect-an-http-quoted-string
String collect_an_http_quoted_string(GenericLexer& lexer, HttpQuotedStringExtractValue extract_value)
{
// 1. Let positionStart be position.
auto position_start = lexer.tell();
// 2. Let value be the empty string.
StringBuilder value;
// 3. Assert: the code point at position within input is U+0022 (").
VERIFY(lexer.peek() == '"');
// 4. Advance position by 1.
lexer.ignore(1);
// 5. While true:
while (true) {
// 1. Append the result of collecting a sequence of code points that are not U+0022 (") or U+005C (\) from
// input, given position, to value.
auto value_part = lexer.consume_until([](char ch) {
return ch == '"' || ch == '\\';
});
value.append(value_part);
// 2. If position is past the end of input, then break.
if (lexer.is_eof())
break;
// 3. Let quoteOrBackslash be the code point at position within input.
// 4. Advance position by 1.
char quote_or_backslash = lexer.consume();
// 5. If quoteOrBackslash is U+005C (\), then:
if (quote_or_backslash == '\\') {
// 1. If position is past the end of input, then append U+005C (\) to value and break.
if (lexer.is_eof()) {
value.append('\\');
break;
}
// 2. Append the code point at position within input to value.
// 3. Advance position by 1.
value.append(lexer.consume());
}
// 6. Otherwise:
else {
// 1. Assert: quoteOrBackslash is U+0022 (").
VERIFY(quote_or_backslash == '"');
// 2. Break.
break;
}
}
// 6. If the extract-value flag is set, then return value.
if (extract_value == HttpQuotedStringExtractValue::Yes)
return MUST(value.to_string());
// 7. Return the code points from positionStart to position, inclusive, within input.
return MUST(String::from_utf8(lexer.input().substring_view(position_start, lexer.tell() - position_start)));
}
}