← Back to search

github Active

Repository profile

N0taN3rd/Squidwarc

Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head

JavaScript Apache-2.0 master Stack scanned README.md

Open website Open GitHub

Stars: 178
Forks: 25
Watchers: 9
Issues: 11
Commits: 119
Awesome lists: 1

Repository updates

Get generated N0taN3rd/Squidwarc development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-13 03:02

Star growth, last 7 days: 0 0.0%
Commit velocity, last 7 days: 0 0.0%
Stars since baseline: +4
Snapshot coverage: 5

Tracked growth

5 captures since 2026-05-23

Stars from baseline +4

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-13 03:02

Stack signals: 0
Package managers: 2
Manifest files: 2
Dependencies: 479

Frameworks and tools

No framework dependencies detected.

npm Yarn javascript

Dependency files

2 manifests

package.json javascript ecosystem, 38 dependencies
yarn.lock javascript ecosystem, 441 dependencies

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 10
Tags: 0
Stacks: 0

Topics

#browser-automation #chrome #chrome-headless #crawler #crawling #headless-chrome #high-fidelity-preservation #puppeteer #webarchives #webarchiving

Generated tags

No generated tags yet.

Stack labels

No stack labels yet.

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

0 paths

No AI development config files detected.

Similar repositories

Nearest indexed repositories by embedding similarity.

peterk/warcworker

A dockerized, queued high fidelity web archiver based on Squidwarc

62 stars

Python 1 awesome list

N0taN3rd/node-warc

Parse And Create Web ARChive (WARC) files with node.js

104 stars

JavaScript 1 awesome list

ArchiveTeam/grab-site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

1,598 stars

Python 1 awesome list

helgeho/Web2Warc

An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)

26 stars

Scala 1 awesome list

s0rg/crawley

The unix-way web crawler

339 stars

Go 2 awesome lists

netarchivesuite/solrwayback

A search interface and wayback machine for the UKWA Solr based warc-indexer framework.

145 stars

Java 1 awesome list

Metadata

Language: JavaScript
License: Apache-2.0
Default branch: master
Created: 2017-07-20
First commit: 2017-07-20
Last pushed: 2020-05-19
GitHub updated: 2026-07-08
Last synced: 2026-07-13 03:02
Stack detected: 2026-07-13 03:02
Archived: no

Links and files

GitHub Website

https://n0tan3rd.github.io/Squidwarc/

README

Appears in

Awesome Web Archiving

N0taN3rd/Squidwarc

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

peterk/warcworker

N0taN3rd/node-warc

ArchiveTeam/grab-site

helgeho/Web2Warc

s0rg/crawley

netarchivesuite/solrwayback

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

peterk/warcworker

N0taN3rd/node-warc

ArchiveTeam/grab-site

helgeho/Web2Warc

s0rg/crawley

netarchivesuite/solrwayback

Metadata

Links and files

Appears in