netarchivesuite/jwat
Java Web Archive Toolkit
JWAT Tools
Java Web Archive Toolkit
Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)
Java library for reading and writing WARC files with a typed API
Tool and library for handling Web ARChive (WARC) files.
Web archive deduplication tools
Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in this repo is now only for reference. For support and issues of 'warc-indexer', please communicate with NetArchiveSuite.
2 captures since 2026-05-23