github Active

Repository profile

webrecorder/browsertrix-crawler

Run a high-fidelity browser-based web archiving crawler in a single Docker container

TypeScript AGPL-3.0 main Stack scanned README.md

Open website Open GitHub

Stars: 1,079
Forks: 147
Watchers: 22
Issues: 137
Commits: 696
Awesome lists: 1

Repository updates

Get generated webrecorder/browsertrix-crawler development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-13 03:03

Star growth, last 7 days: 0 0.0%
Commit velocity, last 7 days: 0 0.0%
Stars since baseline: +37
Snapshot coverage: 5

Tracked growth

5 captures since 2026-05-23

Stars from baseline +37

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-13 03:03

Stack signals: 0
Package managers: 3
Manifest files: 3
Dependencies: 715

Frameworks and tools

No framework dependencies detected.

npm pip Yarn javascript python

Dependency files

3 manifests

package.json javascript ecosystem, 51 dependencies
requirements.txt python ecosystem, 1 dependency
yarn.lock javascript ecosystem, 663 dependencies

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 7
Tags: 0
Stacks: 0

Topics

#crawler #crawling #wacz #warc #web-archiving #web-crawler #webrecorder

Generated tags

No generated tags yet.

Stack labels

No stack labels yet.

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

0 paths

No AI development config files detected.

Similar repositories

Nearest indexed repositories by embedding similarity.

webrecorder/browsertrix

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

437 stars

TypeScript 1 awesome list

openzim/zimit

Make a ZIM file from any Web site and surf offline!

813 stars

Python 1 awesome list

internetarchive/brozzler

brozzler - distributed browser-based web crawler

807 stars

Python 1 awesome list

internetarchive/heritrix3

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

3,271 stars

Java 1 awesome list

trickstercache/trickster

Open Source HTTP Reverse Proxy Cache and Time Series Dashboard Accelerator

2,083 stars

Go 1 awesome list

retracedhq/retraced

🔥 A fully open source audit logs service and embeddable UI easily deployed to your own Kubernetes cluster. Brought to you by replicated.com and boxyhq.com 🚀

446 stars

TypeScript 1 awesome list

Metadata

Language: TypeScript
License: AGPL-3.0
Default branch: main
Created: 2020-11-02
First commit: 2020-10-31
Last pushed: 2026-07-11
GitHub updated: 2026-07-12
Last synced: 2026-07-13 03:03
Stack detected: 2026-07-13 03:03
Archived: no

Links and files

GitHub Website

https://crawler.docs.browsertrix.com

README

Appears in

Awesome Web Archiving

webrecorder/browsertrix-crawler

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

webrecorder/browsertrix

openzim/zimit

internetarchive/brozzler

internetarchive/heritrix3

trickstercache/trickster

retracedhq/retraced

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

webrecorder/browsertrix

openzim/zimit

internetarchive/brozzler

internetarchive/heritrix3

trickstercache/trickster

retracedhq/retraced

Metadata

Links and files

Appears in