← Back to search

github Active

Repository profile

harvard-lil/scoop

🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.

JavaScript MIT main Stack scanned README.md

Open GitHub

Stars: 204
Forks: 12
Watchers: 7
Issues: 15
Commits: 1,255
Awesome lists: 1

Repository updates

Get generated harvard-lil/scoop development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-12 03:12

Star growth, last 7 days: 0 0.0%
Commit velocity, last 7 days: 0 0.0%
Stars since baseline: +5
Snapshot coverage: 5

Tracked growth

5 captures since 2026-05-23

Stars from baseline +5

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-12 03:12

Stack signals: 2
Package managers: 3
Manifest files: 5
Dependencies: 558

Frameworks and tools

Express web framework · high confidence
Flask web framework · high confidence

npm pip Poetry javascript python

Dependency files

5 manifests

package.json javascript ecosystem, 24 dependencies
package-lock.json javascript ecosystem, 482 dependencies
.services/signer/pyproject.toml python ecosystem, 4 dependencies
.services/signer/requirements.txt python ecosystem, 24 dependencies
.services/signer/poetry.lock python ecosystem, 24 dependencies

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 0
Tags: 0
Stacks: 2

Topics

No topics indexed.

Generated tags

No generated tags yet.

Stack labels

Express Flask

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

0 paths

No AI development config files detected.

Similar repositories

Nearest indexed repositories by embedding similarity.

Pyx-Corp/spectrawl

The unified web layer for AI agents. Search (8 engines), stealth browse, auth, and act on 24 platforms. One npm install, self-hosted.

26 stars

JavaScript 1 awesome list

AIMLPM/markcrawl

Fast Python web crawler for RAG and AI ingestion. Extracts clean Markdown from any site for LLMs and vector stores.

2 stars

Python 1 awesome list

N0taN3rd/Squidwarc

Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head

178 stars

JavaScript 1 awesome list

ArchiveTeam/grab-site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

1,598 stars

Python 1 awesome list

harvard-lil/warcbench

A tool for exploring, analyzing, transforming, recombining, and extracting data from WARC (Web ARChive) files.

22 stars

Python 1 awesome list

jina-ai/reader

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

11,541 stars

TypeScript 1 awesome list

Metadata

Language: JavaScript
License: MIT
Default branch: main
Created: 2022-09-20
First commit: 2022-09-20
Last pushed: 2025-09-03
GitHub updated: 2026-07-04
Last synced: 2026-07-12 03:12
Stack detected: 2026-07-12 03:12
Archived: no

Links and files

GitHub README

Appears in

Awesome Web Archiving

harvard-lil/scoop

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

Pyx-Corp/spectrawl

AIMLPM/markcrawl

N0taN3rd/Squidwarc

ArchiveTeam/grab-site

harvard-lil/warcbench

jina-ai/reader

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

Pyx-Corp/spectrawl

AIMLPM/markcrawl

N0taN3rd/Squidwarc

ArchiveTeam/grab-site

harvard-lil/warcbench

jina-ai/reader

Metadata

Links and files

Appears in