mirror of
https://github.com/webrecorder/browsertrix-crawler.git
synced 2025-12-08 06:09:48 +00:00
* ad blocking via request interception, extending block rules system, adding new AdBlockRules * Load list of hosts to block from https://raw.githubusercontent.com/StevenBlack/hosts/master/hosts added as json on image build * Enabled via --blockAds and setting a custom message via --adBlockMessage * new test to check for ad blocking * Add test-crawls dir to .gitignore and .dockerignore |
||
|---|---|---|
| .. | ||
| fixtures | ||
| .DS_Store | ||
| adblockrules.test.js | ||
| basic_crawl.test.js | ||
| blockrules.test.js | ||
| collection_name.test.js | ||
| config_file.test.js | ||
| config_stdin.test.js | ||
| custom_driver.test.js | ||
| extra_hops_depth.test.js | ||
| scopes.test.js | ||
| url_file_list.test.js | ||
| warcinfo.test.js | ||