Commit graph

5 commits

Author SHA1 Message Date
benoit74
1218df0560
Adapt to zimscraperlib 5.0.0 - including all rewriting logic moved there - and upgrade other dependencies 2025-01-07 15:53:33 +00:00
benoit74
f8fe4bc6ff
Add option to specify how many first bytes to consider when searching for content charsets in header 2024-06-18 12:01:01 +00:00
benoit74
ab46842e5b
Add options to disable automatic charset retrieval 2024-06-17 11:37:57 +00:00
benoit74
b1c8a35212
Decode content bytes only with supplied charset or static list of charsets to try 2024-06-17 07:25:11 +00:00
benoit74
be3c0e3919
Use same automatic encoding detection for all contents
Only HTML content was benefiting from automatic encoding detection.
CSS used custom tinycss detection method
Now, all content type (HTML, JS, CSS and JSON) use the same automatic
encoding detection.
2024-06-10 13:28:46 +00:00