browsertrix-crawler/util
Ilya Kreymer 1a1b9b4bff misc fixes:
- allow specifying custom redis start args via REDIS_ARGS env var, parse with splitArgsQuoteSafe()
- unify checking crawl should be stopped, also check when trying to get new page
- if getting new page failed, just return, avoid null dereference
- support adding offset to '-X' ordinal at the end via CRAWL_INDEX_OFFSET env var
2023-08-15 18:41:28 -07:00
..
argParser.js misc fixes: 2023-08-15 18:41:28 -07:00
blockrules.js Switch back to Puppeteer from Playwright (#301) 2023-04-26 15:41:35 -07:00
browser.js profiles: use newly provided puppeteer page.setBypassServiceWorker() (#340) 2023-07-06 10:09:32 -04:00
constants.js Switch back to Puppeteer from Playwright (#301) 2023-04-26 15:41:35 -07:00
file_reader.js feat: Add custom behavior injection (#285) 2023-07-06 13:09:48 -07:00
healthcheck.js Logger cleanup (#254) 2023-03-17 14:24:44 -07:00
logger.js Log fatal messages to redis errors (#305) 2023-05-07 00:43:19 -07:00
originoverride.js Origin Overrides: Ensure Host header also set (#326) 2023-05-30 19:25:37 -07:00
redis.js optimize shutdown: if after interrupt signal was received, redis connection is gone, assume crawler is being terminated and exit quickly, (#292) 2023-04-24 09:50:49 -07:00
screencaster.js Catch loading issues (#255) 2023-03-20 18:31:37 -07:00
screenshots.js Improve thumbnails with sharp (#304) 2023-05-19 11:30:24 -07:00
seeds.js args parsing: fix parseRx() for inclusions/exclusions to deal with non-string types (fixes #352) (#353) 2023-08-13 15:08:36 -07:00
state.js state: adjust redis keys to be more consistent (#309) 2023-05-07 13:01:24 -07:00
storage.js Fix disk utilization computation errors (#338) 2023-07-05 21:58:28 -07:00
textextract.js Convert to ESM (#179) 2022-11-15 18:30:27 -08:00
timing.js Add option for sleep interval after behaviors run + timing cleanup (#257) 2023-03-22 11:50:18 -07:00
worker.js misc fixes: 2023-08-15 18:41:28 -07:00