internetarchive/warcprox
WARC writing MITM HTTP/S proxy
brozzler - distributed browser-based web crawler
WARC writing MITM HTTP/S proxy
The unix-way web crawler
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
Run a high-fidelity browser-based web archiving crawler in a single Docker container
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Make a ZIM file from any Web site and surf offline!
2 captures since 2026-05-23