← ClaudeAtlas

crawl4ailisted

This skill should be used when users need to scrape websites, extract structured data, handle JavaScript-heavy pages, crawl multiple URLs, or build automated web data pipelines. Includes optimized extraction patterns with schema generation for efficient, LLM-free extraction.
aiskillstore/marketplace · ★ 329 · AI & Automation · score 79
Install: claude install-skill aiskillstore/marketplace
# Crawl4AI ## Overview Crawl4AI provides comprehensive web crawling and data extraction capabilities. This skill supports both **CLI** (recommended for quick tasks) and **Python SDK** (for programmatic control). **Choose your interface:** - **CLI** (`crwl`) - Quick, scriptable commands: [CLI Guide](references/cli-guide.md) - **Python SDK** - Full programmatic control: [SDK Guide](references/sdk-guide.md) --- ## Quick Start ### Installation ```bash pip install crawl4ai crawl4ai-setup # Verify installation crawl4ai-doctor ``` ### CLI (Recommended) ```bash # Basic crawling - returns markdown crwl https://example.com # Get markdown output crwl https://example.com -o markdown # JSON output with cache bypass crwl https://example.com -o json -v --bypass-cache # See more examples crwl --example ``` ### Python SDK ```python import asyncio from crawl4ai import AsyncWebCrawler async def main(): async with AsyncWebCrawler() as crawler: result = await crawler.arun("https://example.com") print(result.markdown[:500]) asyncio.run(main()) ``` For SDK configuration details: [SDK Guide - Configuration](references/sdk-guide.md#configuration) (lines 61-150) --- ## Core Concepts ### Configuration Layers Both CLI and SDK use the same underlying configuration: | Concept | CLI | SDK | |---------|-----|-----| | Browser settings | `-B browser.yml` or `-b "param=value"` | `BrowserConfig(...)` | | Crawl settings | `-C crawler.yml` or `-c "param=value"` | `Crawle