internetarchive/warctools
Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)
Web archive deduplication tools
Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)
Tool and library for handling Web ARChive (WARC) files.
Java library for reading and writing WARC files with a typed API
Streaming WARC/ARC library for fast web archive IO
Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in this repo is now only for reference. For support and issues of 'warc-indexer', please communicate with NetArchiveSuite.
Converts WARC files to static HTML
2 captures since 2026-05-23