N0taN3rd/Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Wget-compatible web downloader and crawler.
The unix-way web crawler
A search interface and wayback machine for the UKWA Solr based warc-indexer framework.
🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.
The unified web layer for AI agents. Search (8 engines), stealth browse, auth, and act on 24 platforms. One npm install, self-hosted.
2 captures since 2026-05-23