N0taN3rd/Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
A dockerized, queued high fidelity web archiver based on Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Parse And Create Web ARChive (WARC) files with node.js
A search interface and wayback machine for the UKWA Solr based warc-indexer framework.
Tool and library for handling Web ARChive (WARC) files.
Streaming WARC/ARC library for fast web archive IO
WarcDB: Web crawl data as SQLite databases.
2 captures since 2026-05-23