Sign in
← Back to search
Stars
140
Forks
18
Commits
1445
Language
Rust
Awesome lists
1

Similar repositories

harvard-lil/warcbench

A tool for exploring, analyzing, transforming, recombining, and extracting data from WARC (Web ARChive) files.

14 stars
Python 1 awesome list

chfoo/warcat

Tool and library for handling Web ARChive (WARC) files.

165 stars
Python 1 awesome list

archivesunleashed/aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

157 stars
Scala 1 awesome list

astral-sh/ruff

An extremely fast Python linter and code formatter, written in Rust.

47781 stars
Rust 3 awesome lists

kreuzberg-dev/kreuzberg

A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.

8433 stars
Rust 2 awesome lists

iipc/jwarc

Java library for reading and writing WARC files with a typed API

59 stars
Java 1 awesome list

Tracked growth

2 captures since 2026-05-23

Latest capture 2026-05-31 03:01

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2021-06-22
  • First commit: 2021-06-04
  • Last pushed: 2026-05-29
  • Website: https://resiliparse.chatnoir.eu
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

No AI development config files detected.