s0rg/crawley
The unix-way web crawler
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
The unix-way web crawler
The API to search, scrape, and interact with the web at scale. 🔥
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Fast Python web crawler for RAG and AI ingestion. Extracts clean Markdown from any site for LLMs and vector stores.
Transform developer documentation to clean Markdown
Crawleo MCP Server - Real-Time Web Knowledge for AI Crawleo's Model Context Protocol (MCP) server enables AI assistants like Claude to access real-time web data directly through native tool integration.
2 captures since 2026-05-27
pyproject.toml
· python · 68 dependencies
uv.lock
· python · 0 dependencies
docs/pyproject.toml
· python · 0 dependencies
website/package.json
· javascript · 44 dependencies
website/pnpm-lock.yaml
· javascript · 0 dependencies
website/roa-loader/package.json
· javascript · 1 dependencies
src/crawlee/project_template/{{cookiecutter.project_name}}/pyproject.toml
· python · 4 dependencies
src/crawlee/project_template/{{cookiecutter.project_name}}/requirements.txt
· python · 3 dependencies
AI agent config detected
Key config paths
AGENTS.md
CLAUDE.md
GEMINI.md