cf-crawl

Solid

Crawl entire websites using Cloudflare Browser Rendering /crawl API. Initiates async crawl jobs, polls for completion, and saves results as markdown files. Useful for ingesting documentation sites, knowledge bases, or any web content into your project context. Requires CLOUDFLARE_ACCOUNT_ID and CLOUDFLARE_API_TOKEN environment variables.

Web & Frontend 27,984 stars 2901 forks Updated today MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%

100

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# Cloudflare Website Crawler You are a web crawling assistant that uses Cloudflare's Browser Rendering /crawl REST API to crawl websites and save their content as markdown files for local use. ## Prerequisites The user must have: 1. A Cloudflare account with Browser Rendering enabled 2. `CLOUDFLARE_ACCOUNT_ID` and `CLOUDFLARE_API_TOKEN` available (see below) ## Workflow When the user asks to crawl a website, follow this exact workflow: ### Step 1: Load Credentials Look for `CLOUDFLARE_ACCOUNT_ID` and `CLOUDFLARE_API_TOKEN` in this order: 1. **Current environment variables** - Check if already exported in the shell 2. **Project `.env` file** - Read `.env` in the current working directory and extract the values 3. **Project `.env.local` file** - Read `.env.local` in the current working directory 4. **Home directory `.env`** - Read `~/.env` as a last resort To load from a `.env` file, parse it line by line looking for `CLOUDFLARE_ACCOUNT_ID=` and `CLOUDFLARE_API_TOKEN=` entries. Use this bash approach: ```bash # Load from .env if vars are not already set if [ -z "$CLOUDFLARE_ACCOUNT_ID" ] || [ -z "$CLOUDFLARE_API_TOKEN" ]; then for envfile in .env .env.local "$HOME/.env"; do if [ -f "$envfile" ]; then eval "$(grep -E '^CLOUDFLARE_(ACCOUNT_ID|API_TOKEN)=' "$envfile" | sed 's/^/export /')" fi done fi ``` If credentials are still missing after checking all sources, tell the user to add them to their project `.env` file: ``` CLOUDFLARE_ACCOUNT_ID=your-ac...

Details

Author: davila7
Repository: davila7/claude-code-templates
Created: 11 months ago
Last Updated: today
Language: Python
License: MIT

Integrates with

Anthropic · AI Cloudflare · Cloud REST API · API

Similar Skills

Semantically similar based on skill content — not just same category

Web & Frontend Listed

cf-crawl

45 Updated today

YuDefine

AI & Automation Listed

crawl

Fetch web pages that may be JS-rendered or bot-protected, returning clean markdown or HTML. Works zero-setup with a direct HTTP fetch; if you supply your own Cloudflare Browser Rendering credentials it uses a managed headless browser that renders JS and bypasses most WAFs. Use when a plain fetch returns 403, when a page is a JS-rendered SPA, or when you need reliable markdown extraction from a URL. Triggers on 'crawl this page', 'fetch this URL', 'scrape this site', 'get the content from this page', 'this page is blocked', or when a normal fetch fails on a URL.

0 Updated 6 days ago

eprouveze

Data & Documents Listed

firecrawl

Search, scrape, and interact with the web via the Firecrawl CLI. Use this skill whenever the user wants to search the web, find articles, research a topic, look something up online, scrape a webpage, grab content from a URL, get data from a website, crawl documentation, download a site, or interact with pages that need clicks or logins. Also use when they say "fetch this page", "pull the content from", "get the page at https://", or reference external websites. This provides real-time web search with full page content and interact capabilities — beyond what Claude can do natively with built-in tools. Do NOT trigger for local file operations, git commands, deployments, or code editing tasks.

25 Updated 2 days ago

georgekhananaev