Sign in
← Back to search

natliblux/warc-safe

A tool for detecting viruses and NSFW material in WARC files

Stars
18
Forks
1
Commits
30
Language
Python
Awesome lists
1

Similar repositories

chfoo/warcat

Tool and library for handling Web ARChive (WARC) files.

165 stars
Python 1 awesome list

recrm/ArchiveTools

A collection of tools for archiving and analysing the internet.

78 stars
Python 1 awesome list

internetarchive/warctools

Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)

174 stars
Python 1 awesome list

webrecorder/warcio

Streaming WARC/ARC library for fast web archive IO

458 stars
Python 1 awesome list

ukwa/webarchive-discovery

Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in this repo is now only for reference. For support and issues of 'warc-indexer', please communicate with NetArchiveSuite.

132 stars
Java 1 awesome list

Tracked growth

2 captures since 2026-05-23

Latest capture 2026-05-31 03:02

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2024-05-03
  • First commit: 2024-05-03
  • Last pushed: 2026-05-20
  • Archived: no
  • Stack detected: —
  • License: GPL-3.0

AI development signals

No AI development config files detected.