← Back to search

github Active

Repository profile

turicas/crau

Easy-to-use Web archiver

Python LGPL-3.0 develop Stack scanned README.md

Open GitHub

Stars: 64
Forks: 10
Watchers: 2
Issues: 11
Commits: 76
Awesome lists: 1

Repository updates

Get generated turicas/crau development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-13 03:02

Star growth, last 7 days: 0 0.0%
Commit velocity, last 7 days: 0 0.0%
Stars since baseline: 0
Snapshot coverage: 5

Tracked growth

5 captures since 2026-05-23

Stars from baseline 0

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-13 03:02

Stack signals: 2
Package managers: 1
Manifest files: 3
Dependencies: 17

Frameworks and tools

pytest test framework · high confidence
Scrapy crawler framework · high confidence

pip python

Dependency files

3 manifests

requirements-development.txt python ecosystem, 7 dependencies
requirements.txt python ecosystem, 5 dependencies
setup.py python ecosystem, 5 dependencies

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 0
Tags: 0
Stacks: 2

Topics

No topics indexed.

Generated tags

No generated tags yet.

Stack labels

pytest Scrapy

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

0 paths

No AI development config files detected.

Similar repositories

Nearest indexed repositories by embedding similarity.

karust/gogetcrawl

Extract web archive data using Wayback Machine and Common Crawl

183 stars

Go 1 awesome list

chfoo/warcat

Tool and library for handling Web ARChive (WARC) files.

165 stars

Python 1 awesome list

oduwsdl/archivenow

A Tool To Push Web Resources Into Web Archives

434 stars

Python 1 awesome list

ArchiveTeam/grab-site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

1,598 stars

Python 1 awesome list

webrecorder/warcio

Streaming WARC/ARC library for fast web archive IO

459 stars

Python 1 awesome list

Florents-Tselai/WarcDB

WarcDB: Web crawl data as SQLite databases.

406 stars

Python 1 awesome list

Metadata

Language: Python
License: LGPL-3.0
Default branch: develop
Created: 2019-10-26
First commit: 2019-10-26
Last pushed: 2026-04-13
GitHub updated: 2025-12-05
Last synced: 2026-07-13 03:02
Stack detected: 2026-07-13 03:02
Archived: no

Links and files

GitHub README

Appears in

Awesome Web Archiving

turicas/crau

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

karust/gogetcrawl

chfoo/warcat

oduwsdl/archivenow

ArchiveTeam/grab-site

webrecorder/warcio

Florents-Tselai/WarcDB

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

karust/gogetcrawl

chfoo/warcat

oduwsdl/archivenow

ArchiveTeam/grab-site

webrecorder/warcio

Florents-Tselai/WarcDB

Metadata

Links and files

Appears in