browsertrix-crawler/tests
Ilya Kreymer bb9c82493b
QA Crawl Support (Beta) (#469)
Initial (beta) support for QA/replay crawling!
- Supports running a crawl over a given WACZ / list of WACZ (multi WACZ) input, hosted in ReplayWeb.page
- Runs local http server with full-page, ui-less ReplayWeb.page embed
- ReplayWeb.page release version configured in the Dockerfile, pinned ui.js and sw.js fetched directly from cdnjs

Can be deployed with `webrecorder/browsertrix-crawler qa` entrypoint.
- Requires `--qaSource`, pointing to WACZ or multi-WACZ json that will be replay/QAd
- Also supports `--qaRedisKey` where QA comparison data will be pushed, if specified.
- Supports `--qaDebugImageDiff` for outputting crawl / replay/ diff
images.
- If using --writePagesToRedis, a `comparison` key is added to existing page data where:
```
  comparison: {
    screenshotMatch?: number;
    textMatch?: number;
    resourceCounts: {
      crawlGood?: number;
      crawlBad?: number;
      replayGood?: number;
      replayBad?: number;
    };
  };
  ```
- bump version to 1.1.0-beta.2
2024-03-22 17:32:42 -07:00
..
custom-behaviors Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
fixtures Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
invalid-behaviors detect invalid custom behaviors on load: (#450) 2023-12-13 15:14:53 -05:00
.DS_Store tests text extraction (#30) 2021-03-01 16:00:23 -08:00
adblockrules.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
add-exclusion.test.js QA Crawl Support (Beta) (#469) 2024-03-22 17:32:42 -07:00
basic_crawl.test.js WARC filename prefix + rollover size + improved 'livestream' / truncated response support. (#440) 2023-12-07 23:02:55 -08:00
blockrules.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
collection_name.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
config_file.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
config_stdin.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
crawl_overwrite.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
custom-behavior.test.js detect invalid custom behaviors on load: (#450) 2023-12-13 15:14:53 -05:00
custom_driver.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
extra_hops_depth.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
file_stats.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
limit_reached.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
log_filtering.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
mult_url_crawl_with_favicon.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
pageinfo-records.test.js Better tracking of failed requests + logging context exclude (#485) 2024-03-07 11:35:53 -05:00
qa_compare.test.js QA Crawl Support (Beta) (#469) 2024-03-22 17:32:42 -07:00
redis_crawl_state.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
saved-state.test.js QA Crawl Support (Beta) (#469) 2024-03-22 17:32:42 -07:00
scopes.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
screenshot.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
seeds.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
sitemap-parse.test.js QA Crawl Support (Beta) (#469) 2024-03-22 17:32:42 -07:00
storage.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
text-extract.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
url_file_list.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00
warcinfo.test.js Add Prettier to the repo, and format all the files! (#428) 2023-11-09 16:11:11 -08:00