Sign in

Awesome List

Awesome Web Archiving

An Awesome List for getting started with web archiving

iipc/awesome-web-archiving #awesome#awesome-list#webarchiving
List stars
2,561
README repos
105
Indexed repos
103
List commits
162
Forks
193
Open issues
9

Tracked list growth

GitHub stars and default-branch commits for iipc/awesome-web-archiving.

Latest scan 2026-06-02 10:49

Likes history

GitHub stars

Commits history

Default branch commits

Indexed repositories

103 repos currently saved from this list.

No filters applied
Latest repo push 2026-05-30

Age filters use known first-commit dates and exclude repositories that have not synced that data yet.

Reset
Highlighted

Open highlighted repo slot

Put your repository first

Promote a GitHub repo at the top of Awesome repository list views for 7 days.

wabarc/wayback

An archiving tool with an IM-style interface that prioritizes privacy and accessibility, integrated with various archival services including Internet Archive, archive.today, Ghostarchive, IPFS, Telegraph, and file systems.

Go #archive#har#heroku#internet-archive pushed 2026-05-28 523 commits first commit 2020-06-13 3 list mentions
WikiTeam/wikiteam

Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2026, WikiTeam has preserved more than 600,000 wikis.

Python #archive-wikis#backup#digital-preservation#dump pushed 2026-01-10 1,143 commits first commit 2011-04-05 1 list mention
helgeho/ArchiveSpark

An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.

Scala #archivespark#internet-archive#spark#spark-framework pushed 2025-10-08 154 commits first commit 2015-08-06 1 list mention