webrecorder/browsertrix
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
Run a high-fidelity browser-based web archiving crawler in a single Docker container
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
Make a ZIM file from any Web site and surf offline!
brozzler - distributed browser-based web crawler
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
The unix-way web crawler
2 captures since 2026-05-23