Sign in
← Back to search

ikreymer/webarchive-indexing

Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.

Stars
47
Forks
12
Commits
47
Language
Python
Awesome lists
1

Similar repositories

netarchivesuite/solrwayback

A search interface and wayback machine for the UKWA Solr based warc-indexer framework.

144 stars
Java 1 awesome list

internetarchive/arch

Web application for distributed compute analysis of Archive-It web archive collections.

20 stars
Scala 1 awesome list

internetarchive/warctools

Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)

174 stars
Python 1 awesome list

Tracked growth

2 captures since 2026-05-23

Latest capture 2026-05-31 03:01

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2015-03-09
  • First commit: 2015-02-26
  • Last pushed: 2017-12-04
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

No AI development config files detected.