Use preferred Browsertrix Crawler arguments and fix multiple/file seeds support

This commit is contained in:
benoit74 2025-02-13 15:30:30 +00:00
parent dc6b5aafb7
commit ed1a8a0aa9
No known key found for this signature in database
GPG key ID: B89606434FC7B530
6 changed files with 75 additions and 46 deletions

View file

@ -13,6 +13,14 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
- Keep temporary folder when crawler or warc2zim fails, even if not asked for (#468)
- Add many missing Browsertrix Crawler arguments ; drop default overrides by zimit ; drop `--noMobileDevice` setting (not needed anymore) (#433)
- Document all Browsertrix Crawler default arguments values (#416)
- Use preferred Browsertrix Crawler arguments names:
- `--seeds` instead of `--url`
- `--seedFile` instead of `--urlFile`
- `--pageLimit` instead of `--limit`
- `--pageLoadTimeout` instead of `--timeout`
- `--scopeIncludeRx` instead of `--include`
- `--scopeExcludeRx` instead of `--exclude`
- `--pageExtraDelay` instead of `--delay`
### Fixed