Sign in
← Back to search

netarchivesuite/solrwayback

A search interface and wayback machine for the UKWA Solr based warc-indexer framework.

Stars
144
Forks
28
Commits
3107
Language
Java
Awesome lists
1

Similar repositories

ukwa/webarchive-discovery

Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in this repo is now only for reference. For support and issues of 'warc-indexer', please communicate with NetArchiveSuite.

132 stars
Java 1 awesome list

N0taN3rd/Squidwarc

Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head

174 stars
JavaScript 1 awesome list

ArchiveTeam/grab-site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

1572 stars
Python 1 awesome list

helgeho/Web2Warc

An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)

26 stars
Scala 1 awesome list

internetarchive/warctools

Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)

174 stars
Python 1 awesome list

Tracked growth

2 captures since 2026-05-23

Latest capture 2026-05-31 03:02

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2017-02-08
  • First commit: 2017-02-08
  • Last pushed: 2026-05-07
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

No AI development config files detected.