Commit graph

653 commits

Author SHA1 Message Date
benoit74
270f5dbaae
Merge pull request #446 from elfkuzco/encoding-aliases
provide default encoding aliases
2025-03-20 16:28:32 +01:00
Uchechukwu Orji
9bf93258e8 make encoding aliases case-insensitive 2025-03-20 13:28:47 +01:00
Uchechukwu Orji
67e74d1a23 provide default encoding aliases 2025-03-17 14:39:53 +01:00
benoit74
b66d6a2692
Prepare for 2.2.3 2025-02-17 09:48:52 +00:00
benoit74
5796721b8d
Release 2.2.2 2025-02-17 09:45:04 +00:00
benoit74
a3e4f384ed
Merge pull request #439 from openzim/upgrade_deps
Upgrade dependencies especially zimscraperlib 5.1.1
2025-02-17 10:04:24 +01:00
benoit74
a90e1f40e6
Upgrade dependencies especially zimscraperlib 5.1.1 2025-02-17 08:49:55 +00:00
benoit74
62d3fe556c
Prepare for 2.2.2 2025-02-07 08:35:54 +00:00
benoit74
693a30c423
Update pypa pypi publish action 2025-02-07 08:32:21 +00:00
benoit74
254ac56ef7
Release 2.2.1 2025-02-07 08:29:42 +00:00
benoit74
c0644278f6
Merge pull request #428 from openzim/fork_cdxj_indexer
Fork cdxj indexer
2025-02-06 14:49:28 +01:00
benoit74
7b53d8a463
Add back software architecture which was removed by mistake + update 2025-02-06 13:42:32 +00:00
benoit74
c590b28ef5
Remove unused code/tests + 'modernize' 2025-02-06 13:42:32 +00:00
benoit74
77992b4fc5
Fork cdxj_indexer files as-of upstream commit 9ad2b9e 2025-02-06 13:41:17 +00:00
benoit74
acc8b06388
Merge pull request #434 from openzim/upgrade_deps
Upgrade dependencies - Python 3.13
2025-02-03 15:53:15 +01:00
benoit74
cd3251b978
Fix linter / type checker issues 2025-02-03 14:50:58 +00:00
benoit74
4c584cab75
Upgrade dependencies - Python 3.13 2025-02-03 14:41:48 +00:00
benoit74
eeeb554346
Prepare for 2.2.1 2025-01-10 10:02:46 +00:00
benoit74
6d7bf10c5b
Release 2.2.0 2025-01-10 09:51:39 +00:00
benoit74
dcdd34be9c
Merge pull request #430 from openzim/scraperlib_rc3
Use scraperlib 5.0.0rc3
2025-01-07 17:20:32 +01:00
benoit74
606c4e5cbb
Use scraperlib 5.0.0rc3 2025-01-07 16:16:45 +00:00
benoit74
1ba52851ac
Merge pull request #418 from openzim/scraperlib_4_1
Use HTML/JS/CSS functions extracted in zimscraperlib and adapt to scraperlib 5
2025-01-07 17:00:16 +01:00
benoit74
1218df0560
Adapt to zimscraperlib 5.0.0 - including all rewriting logic moved there - and upgrade other dependencies 2025-01-07 15:53:33 +00:00
benoit74
5040eeeffb
Merge pull request #425 from openzim/double_slash_main
Stop checking main entry processability when it is already found
2024-11-27 11:23:32 +01:00
benoit74
1e6367c712
Add test website cases for double slash 2024-11-26 14:16:18 +00:00
benoit74
8fb6478744
Stop checking main entry processability when it is already found 2024-11-25 21:39:53 +00:00
benoit74
8733200fac
Merge pull request #334 from openzim/iframe_rewrite
Document wombat settings and change wombat mode to fix major issue of MDN ZIM
2024-11-15 16:40:06 +01:00
benoit74
1eb41ff08e
Upgrade to wombat 3.8.6 2024-11-15 10:01:51 +00:00
benoit74
161b32313b
Update CHANGELOG 2024-11-14 14:18:14 +00:00
benoit74
9c94b0cd09
Upgrade to wombat 3.8.4 2024-11-14 13:52:12 +00:00
benoit74
db3037ed6a
Set isSW to false and add documentation on wbInfo
See
https://github.com/webrecorder/wombat/issues/155#issuecomment-2183191941
for details about why we need to set isSW to false
2024-11-14 13:52:12 +00:00
benoit74
20bf43f23e
Prepare for 2.1.4 2024-11-01 13:20:54 +00:00
benoit74
a8b934fdde
Release 2.1.3 2024-11-01 13:16:42 +00:00
benoit74
7286db3e7f
Merge pull request #414 from openzim/update_deps
Upgrade to wombat 3.8.3
2024-11-01 09:40:51 +01:00
benoit74
a7a74819d5
Upgrade to wombat 3.8.3 2024-11-01 08:34:45 +00:00
benoit74
457c991be4
Enhance test website with a <form> test case 2024-10-11 19:13:23 +00:00
benoit74
c56d4ce88b
Prepare for 2.1.3 2024-10-08 12:30:48 +00:00
benoit74
dc36cd80e7
Release 2.1.2 2024-10-08 12:06:39 +00:00
benoit74
f56bdea7dc
Merge pull request #407 from openzim/upgrade_deps
Upgrade dependencies, including wombat 3.8.2
2024-10-08 13:48:41 +02:00
benoit74
29307e6b69
Upgrade dependencies, including wombat 3.8.2 2024-10-08 11:27:22 +00:00
benoit74
38e590232d
Merge pull request #406 from openzim/html_as_fetch
HTML document can be retrieved as `fetch`
2024-10-08 13:16:18 +02:00
benoit74
3c7363f050
Fix type hints and add CHANGELOG
Nota: the two on content and mimetype are just linked to
https://github.com/openzim/python-scraperlib/issues/196
and will have to be reverted once this issue is fixed
2024-10-08 10:02:43 +00:00
benoit74
5d452e17c4
HTML documents can be retrieved as 'fetch' as well (fix #405) 2024-10-08 09:52:38 +00:00
benoit74
6534213a6c
Add test case on test website around pictures srcsets 2024-10-08 09:51:45 +00:00
benoit74
c8e1e96447
Delete .github/dependabot.yml
Remove dependabot, this is creating too much noise / risks for the scraper stability
2024-09-12 21:19:21 +02:00
benoit74
43d72209e7
Prepare for 2.1.2 2024-09-05 07:15:44 +00:00
benoit74
1737ab3327
Release 2.1.1 2024-09-05 07:13:44 +00:00
benoit74
9de0d96b63
Merge pull request #388 from openzim/dependabot/pip/production-dependencies-e14423a93f
Bump pyright from 1.1.378 to 1.1.379 in the production-dependencies group
2024-09-05 09:06:21 +02:00
dependabot[bot]
72bb166491
Bump pyright in the production-dependencies group
Bumps the production-dependencies group with 1 update: [pyright](https://github.com/RobertCraigie/pyright-python).


Updates `pyright` from 1.1.378 to 1.1.379
- [Release notes](https://github.com/RobertCraigie/pyright-python/releases)
- [Commits](https://github.com/RobertCraigie/pyright-python/compare/v1.1.378...v1.1.379)

---
updated-dependencies:
- dependency-name: pyright
  dependency-type: direct:production
  update-type: version-update:semver-patch
  dependency-group: production-dependencies
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-09-04 17:33:34 +00:00
benoit74
5592f8cb6e
Merge pull request #386 from openzim/upgrade_deps
Upgrade deps, especially wombat 3.8.0
2024-09-03 16:12:59 +02:00