Pyx-Corp/spectrawl
The unified web layer for AI agents. Search (8 engines), stealth browse, auth, and act on 24 platforms. One npm install, self-hosted.
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
The unified web layer for AI agents. Search (8 engines), stealth browse, auth, and act on 24 platforms. One npm install, self-hosted.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Fast Python web crawler for RAG and AI ingestion. Extracts clean Markdown from any site for LLMs and vector stores.
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
A self-hosted API that takes a URL and returns a file with browser screenshots.
The API to search, scrape, and interact with the web at scale. 🔥
1 capture since 2026-06-02
pyproject.toml
· python · 21 dependencies
setup.cfg
· python · 0 dependencies
docs/requirements.txt
· python · 8 dependencies
tests/requirements.txt
· python · 8 dependencies