ladybird/Libraries/LibCompress/Deflate.h

/*
 * Copyright (c) 2020, the SerenityOS developers.
 * Copyright (c) 2021, Idan Horowitz <idan.horowitz@serenityos.org>
 * Copyright (c) 2025, Altomani Gianluca <altomanigianluca@gmail.com>
 *
 * SPDX-License-Identifier: BSD-2-Clause
 */

#pragma once

#include <AK/BitStream.h>
#include <AK/Stream.h>
#include <LibCompress/Zlib.h>

namespace Compress {

class CanonicalCode {
public:
    CanonicalCode() = default;
    ErrorOr<u32> read_symbol(LittleEndianInputBitStream&) const;
    ErrorOr<void> write_symbol(LittleEndianOutputBitStream&, u32) const;

    static CanonicalCode const& fixed_literal_codes();
    static CanonicalCode const& fixed_distance_codes();

    static ErrorOr<CanonicalCode> from_bytes(ReadonlyBytes);

private:
    static constexpr size_t max_allowed_prefixed_code_length = 8;

    struct PrefixTableEntry {
        u16 symbol_value { 0 };
        u16 code_length { 0 };
    };

    // Decompression - indexed by code
    Vector<u16, 286> m_symbol_codes;
    Vector<u16, 286> m_symbol_values;

    Array<PrefixTableEntry, 1 << max_allowed_prefixed_code_length> m_prefix_table {};
    size_t m_max_prefixed_code_length { 0 };

    // Compression - indexed by symbol
    // Deflate uses a maximum of 288 symbols (maximum of 32 for distances),
    // but this is also used by webp, which can use up to 256 + 24 + (1 << 11) == 2328 symbols.
    Vector<u16, 288> m_bit_codes {};
    Vector<u16, 288> m_bit_code_lengths {};
};

ALWAYS_INLINE ErrorOr<void> CanonicalCode::write_symbol(LittleEndianOutputBitStream& stream, u32 symbol) const
{
    auto code = symbol < m_bit_codes.size() ? m_bit_codes[symbol] : 0u;
    auto length = symbol < m_bit_code_lengths.size() ? m_bit_code_lengths[symbol] : 0u;
    TRY(stream.write_bits(code, length));
    return {};
}

class DeflateDecompressor final : public GenericZlibDecompressor {
public:
    static ErrorOr<NonnullOwnPtr<DeflateDecompressor>> create(MaybeOwned<Stream>);
    static ErrorOr<ByteBuffer> decompress_all(ReadonlyBytes);

private:
    DeflateDecompressor(AK::FixedArray<u8> buffer, MaybeOwned<Stream> stream, z_stream* zstream)
        : GenericZlibDecompressor(move(buffer), move(stream), zstream)
    {
    }
};

class DeflateCompressor final : public GenericZlibCompressor {
public:
    static ErrorOr<NonnullOwnPtr<DeflateCompressor>> create(MaybeOwned<Stream>, GenericZlibCompressionLevel = GenericZlibCompressionLevel::Default);
    static ErrorOr<ByteBuffer> compress_all(ReadonlyBytes, GenericZlibCompressionLevel = GenericZlibCompressionLevel::Default);

private:
    DeflateCompressor(AK::FixedArray<u8> buffer, MaybeOwned<Stream> stream, z_stream* zstream)
        : GenericZlibCompressor(move(buffer), move(stream), zstream)
    {
    }
};

}
LibCompress: Add LibCompress For now this only contains DEFLATE, and a very simple Zlib Eventually GZip, etc. can go here as well. 2020-08-01 22:01:39 +02:00			`/*`
Everywhere: Use "the SerenityOS developers." in copyright headers We had some inconsistencies before: - Sometimes "The", sometimes "the" - Sometimes trailing ".", sometimes no trailing "." I picked the most common one (lowecase "the", trailing ".") and applied it to all copyright headers. By using the exact same string everywhere we can ensure nothing gets missed during a global search (and replace), and that these inconsistencies are not spread any further (as copyright headers are commonly copied to new files). 2021-04-28 22:46:44 +02:00			`* Copyright (c) 2020, the SerenityOS developers.`
AK+Userland: Use idan.horowitz@serenityos.org for my copyright headers 2021-04-22 23:40:43 +03:00			`* Copyright (c) 2021, Idan Horowitz <idan.horowitz@serenityos.org>`
LibCompress: Refactor deflate de/compressor using zlib Also remove two crash tests that are not relevant anymore because the implementation changed substantially. 2025-03-01 17:41:22 +01:00			`* Copyright (c) 2025, Altomani Gianluca <altomanigianluca@gmail.com>`
LibCompress: Add LibCompress For now this only contains DEFLATE, and a very simple Zlib Eventually GZip, etc. can go here as well. 2020-08-01 22:01:39 +02:00			`*`
Everything: Move to SPDX license identifiers in all files. SPDX License Identifiers are a more compact / standardized way of representing file license information. See: https://spdx.dev/resources/use/#identifiers This was done with the `ambr` search and replace tool. ambr --no-parent-ignore --key-from-file --rep-from-file key.txt rep.txt * 2021-04-22 01:24:48 -07:00			`* SPDX-License-Identifier: BSD-2-Clause`
LibCompress: Add LibCompress For now this only contains DEFLATE, and a very simple Zlib Eventually GZip, etc. can go here as well. 2020-08-01 22:01:39 +02:00			`*/`

			`#pragma once`

LibGfx+LibCompress: WebPWriter performance regression reduction This moves both Gfx::CanonicalCode::write_symbol() and Compress::CanonicalCode::write_symbol() inline. It also adds `__attribute__((always_inline))` on the arguments to visit() in the latter. (ALWAYS_INLINE doesn't work on lambdas.) Numbers with `ministat`: I ran once: Build/lagom/bin/image -o test.bmp Base/res/wallpapers/sunset-retro.png and then ran to bench: ~/src/hack/bench.py -n 20 -o bench_foo1.txt \ Build/lagom/bin/image -o test.webp test.bmp ...and then `ministat bench_foo1.txt bench_foo2.txt` to compare. The previous commit increased the time for this command by 38% compared to the before state. With this, it's an 8.6% regression. So still a regression, but a smaller one. Or, in other words, this commit reduces times by 21% compared to the previous commit. Numbers with hyperfine are similar -- with this on top of the previous commit, this is a 7-11% regression, instead of an almost 50% regression. (A local branch that changes how we compute CanonicalCodes so that we actually compress a bit is perf-neutral since the image writing code doesn't change.) `hyperfine 'image -o test.webp test.bmp'`: * Before: 23.7 ms ± 0.7 ms (116 runs) * Previous commit: 33.2 ms ± 0.8 ms (82 runs) * This commit: 25.5 ms ± 0.7 ms (102 runs) `hyperfine 'animation -o wow.webp giphy.gif'`: * Before: 85.5 ms ± 2.0 ms (34 runs) * Previous commit: 127.7 ms ± 4.4 ms (22 runs) * This commit: 95.3 ms ± 2.1 ms (31 runs) `hyperfine 'animation -o wow.webp 7z7c.gif'`: * Before: 12.6 ms ± 0.6 ms (198 runs) * Previous commit: 16.5 ms ± 0.9 ms (153 runs) * This commit: 13.5 ms ± 0.6 ms (186 runs) 2024-05-08 18:57:53 -04:00			`#include <AK/BitStream.h>`
LibCore: Remove `Stream.h` 2023-02-09 03:11:50 +01:00			`#include <AK/Stream.h>`
LibCompress: Refactor deflate de/compressor using zlib Also remove two crash tests that are not relevant anymore because the implementation changed substantially. 2025-03-01 17:41:22 +01:00			`#include <LibCompress/Zlib.h>`
LibCompress: Add LibCompress For now this only contains DEFLATE, and a very simple Zlib Eventually GZip, etc. can go here as well. 2020-08-01 22:01:39 +02:00
			`namespace Compress {`

LibCompress: Move CanonicalCode out of DeflateDecompressor. 2020-09-10 14:06:50 +02:00			`class CanonicalCode {`
			`public:`
			`CanonicalCode() = default;`
AK: Move bit streams from `LibCore` 2023-01-25 20:06:16 +01:00			`ErrorOr<u32> read_symbol(LittleEndianInputBitStream&) const;`
			`ErrorOr<void> write_symbol(LittleEndianOutputBitStream&, u32) const;`
LibCompress: Add LibCompress For now this only contains DEFLATE, and a very simple Zlib Eventually GZip, etc. can go here as well. 2020-08-01 22:01:39 +02:00
Everywhere: Run clang-format 2022-04-01 20:58:27 +03:00			`static CanonicalCode const& fixed_literal_codes();`
			`static CanonicalCode const& fixed_distance_codes();`
LibCompress: Add LibCompress For now this only contains DEFLATE, and a very simple Zlib Eventually GZip, etc. can go here as well. 2020-08-01 22:01:39 +02:00
LibCompress: Make CanonicalCode::from_bytes() return ErrorOr<> No intended behavior change. 2023-04-01 19:52:38 -04:00			`static ErrorOr<CanonicalCode> from_bytes(ReadonlyBytes);`
LibCompress: Deflate: Don't assert that the codes are valid. 2020-08-31 11:41:44 +02:00
LibCompress: Move CanonicalCode out of DeflateDecompressor. 2020-09-10 14:06:50 +02:00			`private:`
LibCompress: Use prefix tables to decode Huffman codes up to 8 bits long Huffman codes have a useful property in that they are prefix codes. That is, a set of bits representing a Huffman-coded symbol is never a prefix of another symbol. This allows us to create a table, where each index in the table are integers whose prefix is the entry's corresponding Huffman code. With Deflate, we can have codes up to 16 bits in length, thus creating a prefix table with 2^16 entries. So instead of creating a table fit all possible codes, we use a cutoff of 8-bit codes. Codes larger than 8 bits fall back to the binary search method. Using the "enwik8" file as a test (100MB uncompressed, commonly used in benchmarks: https://www.mattmahoney.net/dc/enwik8.zip), decompression time decreases from 3.527s to 2.585s on Linux. 2023-03-28 14:45:20 -04:00			`static constexpr size_t max_allowed_prefixed_code_length = 8;`

			`struct PrefixTableEntry {`
			`u16 symbol_value { 0 };`
			`u16 code_length { 0 };`
			`};`

LibCompress: Implement DEFLATE compression This commit adds a fully functional DEFLATE compression implementation that can be used to implement compression for higher level formats like gzip, zlib or zip. A large part of this commit is based on Hans Wennborg's great article about the DEFLATE and zip specifications: https://www.hanshq.net/zip.html 2021-03-13 01:17:18 +02:00			`// Decompression - indexed by code`
gunzip+LibCompress: Increase buffer sizes used by Deflate and gunzip Co-authored-by: Andreas Kling <kling@serenityos.org> 2023-03-30 14:01:07 -04:00			`Vector<u16, 286> m_symbol_codes;`
			`Vector<u16, 286> m_symbol_values;`
LibCompress: Implement DEFLATE compression This commit adds a fully functional DEFLATE compression implementation that can be used to implement compression for higher level formats like gzip, zlib or zip. A large part of this commit is based on Hans Wennborg's great article about the DEFLATE and zip specifications: https://www.hanshq.net/zip.html 2021-03-13 01:17:18 +02:00
LibCompress: Use prefix tables to decode Huffman codes up to 8 bits long Huffman codes have a useful property in that they are prefix codes. That is, a set of bits representing a Huffman-coded symbol is never a prefix of another symbol. This allows us to create a table, where each index in the table are integers whose prefix is the entry's corresponding Huffman code. With Deflate, we can have codes up to 16 bits in length, thus creating a prefix table with 2^16 entries. So instead of creating a table fit all possible codes, we use a cutoff of 8-bit codes. Codes larger than 8 bits fall back to the binary search method. Using the "enwik8" file as a test (100MB uncompressed, commonly used in benchmarks: https://www.mattmahoney.net/dc/enwik8.zip), decompression time decreases from 3.527s to 2.585s on Linux. 2023-03-28 14:45:20 -04:00			`Array<PrefixTableEntry, 1 << max_allowed_prefixed_code_length> m_prefix_table {};`
			`size_t m_max_prefixed_code_length { 0 };`

LibCompress: Implement DEFLATE compression This commit adds a fully functional DEFLATE compression implementation that can be used to implement compression for higher level formats like gzip, zlib or zip. A large part of this commit is based on Hans Wennborg's great article about the DEFLATE and zip specifications: https://www.hanshq.net/zip.html 2021-03-13 01:17:18 +02:00			`// Compression - indexed by symbol`
LibCompress: Tolerate more than 288 entries in CanonicalCode Webp lossless can have up to 2328 symbols. This code assumed the deflate max of 288, leading to crashes for webp lossless files using more than 288 symbols (such as Tests/LibGfx/test-inputs/simple-vp8l.webp). Nothing writes webp files at this point, so the m_bit_codes and m_bit_code_lengths arrays aren't ever used in practice with more than 288 entries. 2023-04-04 11:04:54 -04:00			`// Deflate uses a maximum of 288 symbols (maximum of 32 for distances),`
			`// but this is also used by webp, which can use up to 256 + 24 + (1 << 11) == 2328 symbols.`
			`Vector<u16, 288> m_bit_codes {};`
			`Vector<u16, 288> m_bit_code_lengths {};`
LibCompress: Move CanonicalCode out of DeflateDecompressor. 2020-09-10 14:06:50 +02:00			`};`
LibCompress: Add LibCompress For now this only contains DEFLATE, and a very simple Zlib Eventually GZip, etc. can go here as well. 2020-08-01 22:01:39 +02:00
LibGfx+LibCompress: WebPWriter performance regression reduction This moves both Gfx::CanonicalCode::write_symbol() and Compress::CanonicalCode::write_symbol() inline. It also adds `__attribute__((always_inline))` on the arguments to visit() in the latter. (ALWAYS_INLINE doesn't work on lambdas.) Numbers with `ministat`: I ran once: Build/lagom/bin/image -o test.bmp Base/res/wallpapers/sunset-retro.png and then ran to bench: ~/src/hack/bench.py -n 20 -o bench_foo1.txt \ Build/lagom/bin/image -o test.webp test.bmp ...and then `ministat bench_foo1.txt bench_foo2.txt` to compare. The previous commit increased the time for this command by 38% compared to the before state. With this, it's an 8.6% regression. So still a regression, but a smaller one. Or, in other words, this commit reduces times by 21% compared to the previous commit. Numbers with hyperfine are similar -- with this on top of the previous commit, this is a 7-11% regression, instead of an almost 50% regression. (A local branch that changes how we compute CanonicalCodes so that we actually compress a bit is perf-neutral since the image writing code doesn't change.) `hyperfine 'image -o test.webp test.bmp'`: * Before: 23.7 ms ± 0.7 ms (116 runs) * Previous commit: 33.2 ms ± 0.8 ms (82 runs) * This commit: 25.5 ms ± 0.7 ms (102 runs) `hyperfine 'animation -o wow.webp giphy.gif'`: * Before: 85.5 ms ± 2.0 ms (34 runs) * Previous commit: 127.7 ms ± 4.4 ms (22 runs) * This commit: 95.3 ms ± 2.1 ms (31 runs) `hyperfine 'animation -o wow.webp 7z7c.gif'`: * Before: 12.6 ms ± 0.6 ms (198 runs) * Previous commit: 16.5 ms ± 0.9 ms (153 runs) * This commit: 13.5 ms ± 0.6 ms (186 runs) 2024-05-08 18:57:53 -04:00			`ALWAYS_INLINE ErrorOr<void> CanonicalCode::write_symbol(LittleEndianOutputBitStream& stream, u32 symbol) const`
			`{`
			`auto code = symbol < m_bit_codes.size() ? m_bit_codes[symbol] : 0u;`
			`auto length = symbol < m_bit_code_lengths.size() ? m_bit_code_lengths[symbol] : 0u;`
			`TRY(stream.write_bits(code, length));`
			`return {};`
			`}`

LibCompress: Refactor deflate de/compressor using zlib Also remove two crash tests that are not relevant anymore because the implementation changed substantially. 2025-03-01 17:41:22 +01:00			`class DeflateDecompressor final : public GenericZlibDecompressor {`
LibCompress: Implement DEFLATE properly. Now we have an actual stream implementation that can read arbitrary (dynamic codes aren't supported yet) deflate encoded data. Even if the blocks are really large. And all of that happens with a single buffer of 32KiB. DEFLATE is amazing! 2020-08-26 14:22:25 +02:00			`public:`
LibCompress: Refactor deflate de/compressor using zlib Also remove two crash tests that are not relevant anymore because the implementation changed substantially. 2025-03-01 17:41:22 +01:00			`static ErrorOr<NonnullOwnPtr<DeflateDecompressor>> create(MaybeOwned<Stream>);`
LibCompress: Port `DeflateDecompressor` to `Core::Stream` 2022-12-02 22:01:44 +01:00			`static ErrorOr<ByteBuffer> decompress_all(ReadonlyBytes);`
LibCompress: Implement DEFLATE properly. Now we have an actual stream implementation that can read arbitrary (dynamic codes aren't supported yet) deflate encoded data. Even if the blocks are really large. And all of that happens with a single buffer of 32KiB. DEFLATE is amazing! 2020-08-26 14:22:25 +02:00
			`private:`
LibCompress: Refactor deflate de/compressor using zlib Also remove two crash tests that are not relevant anymore because the implementation changed substantially. 2025-03-01 17:41:22 +01:00			`DeflateDecompressor(AK::FixedArray<u8> buffer, MaybeOwned<Stream> stream, z_stream* zstream)`
			`: GenericZlibDecompressor(move(buffer), move(stream), zstream)`
			`{`
			`}`
LibCompress: Implement DEFLATE compression This commit adds a fully functional DEFLATE compression implementation that can be used to implement compression for higher level formats like gzip, zlib or zip. A large part of this commit is based on Hans Wennborg's great article about the DEFLATE and zip specifications: https://www.hanshq.net/zip.html 2021-03-13 01:17:18 +02:00			`};`

LibCompress: Refactor deflate de/compressor using zlib Also remove two crash tests that are not relevant anymore because the implementation changed substantially. 2025-03-01 17:41:22 +01:00			`class DeflateCompressor final : public GenericZlibCompressor {`
LibCompress: Implement DEFLATE compression This commit adds a fully functional DEFLATE compression implementation that can be used to implement compression for higher level formats like gzip, zlib or zip. A large part of this commit is based on Hans Wennborg's great article about the DEFLATE and zip specifications: https://www.hanshq.net/zip.html 2021-03-13 01:17:18 +02:00			`public:`
LibCompress: Refactor deflate de/compressor using zlib Also remove two crash tests that are not relevant anymore because the implementation changed substantially. 2025-03-01 17:41:22 +01:00			`static ErrorOr<NonnullOwnPtr<DeflateCompressor>> create(MaybeOwned<Stream>, GenericZlibCompressionLevel = GenericZlibCompressionLevel::Default);`
			`static ErrorOr<ByteBuffer> compress_all(ReadonlyBytes, GenericZlibCompressionLevel = GenericZlibCompressionLevel::Default);`
LibCompress: Implement DEFLATE compression This commit adds a fully functional DEFLATE compression implementation that can be used to implement compression for higher level formats like gzip, zlib or zip. A large part of this commit is based on Hans Wennborg's great article about the DEFLATE and zip specifications: https://www.hanshq.net/zip.html 2021-03-13 01:17:18 +02:00
			`private:`
LibCompress: Refactor deflate de/compressor using zlib Also remove two crash tests that are not relevant anymore because the implementation changed substantially. 2025-03-01 17:41:22 +01:00			`DeflateCompressor(AK::FixedArray<u8> buffer, MaybeOwned<Stream> stream, z_stream* zstream)`
			`: GenericZlibCompressor(move(buffer), move(stream), zstream)`
			`{`
			`}`
LibCompress: Add LibCompress For now this only contains DEFLATE, and a very simple Zlib Eventually GZip, etc. can go here as well. 2020-08-01 22:01:39 +02:00			`};`
LibCompress: Turn the DEFLATE implementation into a stream. Previously, the implementation would produce one Vector<u8> which would contain the whole decompressed data. That can be a lot and even exhaust memory. With these changes it is still necessary to store the whole input data in one piece (I am working on this next,) but the output can be read block by block. (That's not optimal either because blocks can be arbitrarily large, but it's good for now.) 2020-08-18 20:49:59 +02:00
LibCompress: Add LibCompress For now this only contains DEFLATE, and a very simple Zlib Eventually GZip, etc. can go here as well. 2020-08-01 22:01:39 +02:00			`}`