Sign in

Awesome List

Awesome Web Archiving

An Awesome List for getting started with web archiving

iipc/awesome-web-archiving #awesome#awesome-list#webarchiving
List stars
2,562
README repos
105
Indexed repos
103
List commits
162
Forks
193
Open issues
9

Tracked list growth

GitHub stars and default-branch commits for iipc/awesome-web-archiving.

Latest scan 2026-06-03 10:49

Likes history

GitHub stars

Commits history

Default branch commits

Indexed repositories

103 repos currently saved from this list.

No filters applied
Latest repo push 2026-05-30

Filter this list

Search within Awesome Web Archiving or narrow by ecosystem and project health.

Search mode
Tune results
More filters Topics, generated tags, stack, age, archive status, and growth.
Ecosystem
Health

Uses known first-commit dates.

Momentum
Reset filters
Highlighted

Open highlighted repo slot

Put your repository first

Promote a GitHub repo at the top of Awesome repository list views for 7 days.

wabarc/wayback

An archiving tool with an IM-style interface that prioritizes privacy and accessibility, integrated with various archival services including Internet Archive, archive.today, Ghostarchive, IPFS, Telegraph, and file systems.

Go #archive#har#heroku#internet-archive pushed 2026-05-28 523 commits first commit 2020-06-13 3 list mentions
WikiTeam/wikiteam

Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2026, WikiTeam has preserved more than 600,000 wikis.

Python #archive-wikis#backup#digital-preservation#dump pushed 2026-01-10 1,143 commits first commit 2011-04-05 1 list mention
helgeho/ArchiveSpark

An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.

Scala #archivespark#internet-archive#spark#spark-framework pushed 2025-10-08 154 commits first commit 2015-08-06 1 list mention