Sign in
← Back to search
Stars
50
Forks
9
Commits
301
Language
Ruby
Awesome lists
1

Similar repositories

archivesunleashed/aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

157 stars
Scala 1 awesome list

internetarchive/arch

Web application for distributed compute analysis of Archive-It web archive collections.

20 stars
Scala 1 awesome list

ukwa/webarchive-discovery

Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in this repo is now only for reference. For support and issues of 'warc-indexer', please communicate with NetArchiveSuite.

132 stars
Java 1 awesome list

0xMassi/webclaw

Fast, local-first web content extraction for LLMs. Scrape, crawl, extract structured data — all from Rust. CLI, REST API, and MCP server.

1261 stars
Rust 2 awesome lists

helgeho/Web2Warc

An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)

26 stars
Scala 1 awesome list

Tracked growth

2 captures since 2026-05-23

Latest capture 2026-05-31 03:01

Stars history

Total stars

Commits history

Default branch commits

Metadata

AI development signals

No AI development config files detected.