Sign in
← Back to search

N0taN3rd/Squidwarc

Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head

Stars
174
Forks
25
Commits
119
Language
JavaScript
Awesome lists
1

Similar repositories

peterk/warcworker

A dockerized, queued high fidelity web archiver based on Squidwarc

62 stars
Python 1 awesome list

N0taN3rd/node-warc

Parse And Create Web ARChive (WARC) files with node.js

104 stars
JavaScript 1 awesome list

ArchiveTeam/grab-site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

1572 stars
Python 1 awesome list

helgeho/Web2Warc

An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)

26 stars
Scala 1 awesome list

s0rg/crawley

The unix-way web crawler

340 stars
Go 2 awesome lists

netarchivesuite/solrwayback

A search interface and wayback machine for the UKWA Solr based warc-indexer framework.

144 stars
Java 1 awesome list

Tracked growth

2 captures since 2026-05-23

Latest capture 2026-05-31 03:02

Stars history

Total stars

Commits history

Default branch commits

Metadata

AI development signals

No AI development config files detected.