midwork-finds-jobs/duckdb_warc
DuckDB extension for parsing WARC files
DuckDB extension to fetch pages from Wayback Machine & Common Crawl
DuckDB extension for parsing WARC files
Extract web archive data using Wayback Machine and Common Crawl
DuckDB is an analytical in-process SQL database management system
Wayback Machine API interface & a command-line tool
DuckDB-powered Postgres for high performance apps & analytics.
WarcDB: Web crawl data as SQLite databases.
2 captures since 2026-05-23
AI agent config detected
Key config paths
CLAUDE.md