Sign in
← Back to search

peterk/warcworker

A dockerized, queued high fidelity web archiver based on Squidwarc

Stars
62
Forks
9
Commits
34
Language
Python
Awesome lists
1

Similar repositories

N0taN3rd/Squidwarc

Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head

174 stars
JavaScript 1 awesome list

N0taN3rd/node-warc

Parse And Create Web ARChive (WARC) files with node.js

104 stars
JavaScript 1 awesome list

netarchivesuite/solrwayback

A search interface and wayback machine for the UKWA Solr based warc-indexer framework.

144 stars
Java 1 awesome list

chfoo/warcat

Tool and library for handling Web ARChive (WARC) files.

165 stars
Python 1 awesome list

webrecorder/warcio

Streaming WARC/ARC library for fast web archive IO

458 stars
Python 1 awesome list

Tracked growth

2 captures since 2026-05-23

Latest capture 2026-05-31 03:02

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2018-07-21
  • First commit: 2018-07-21
  • Last pushed: 2024-07-09
  • Archived: no
  • Stack detected: —
  • License: GPL-3.0

AI development signals

No AI development config files detected.