apify-core-workflow-a

Featured

Build a complete web scraping Actor with Crawlee and deploy to Apify. Use when implementing end-to-end scraping: input schema, crawler, data extraction, dataset output, and platform deployment. Trigger: "apify scrape website", "build apify actor", "crawlee scraper", "apify main workflow".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Apify Core Workflow A — Build & Deploy a Scraper ## Overview End-to-end workflow: define input schema, build a Crawlee-based Actor, extract structured data, store results in datasets, test locally, and deploy to Apify platform. This is the primary money-path workflow for Apify. ## Prerequisites - `npm install apify crawlee` in your project - `npm install -g apify-cli` and `apify login` completed - Familiarity with `apify-sdk-patterns` ## Instructions ### Step 1: Define Input Schema Create `.actor/INPUT_SCHEMA.json`: ```json { "title": "E-Commerce Scraper", "type": "object", "schemaVersion": 1, "properties": { "startUrls": { "title": "Start URLs", "type": "array", "description": "Product listing page URLs to scrape", "editor": "requestListSources", "prefill": [{ "url": "https://example-store.com/products" }] }, "maxItems": { "title": "Max items", "type": "integer", "description": "Maximum number of products to scrape", "default": 100, "minimum": 1, "maximum": 10000 }, "proxyConfig": { "title": "Proxy configuration", "type": "object", "description": "Select proxy to use", "editor": "proxy", "default": { "useApifyProxy": true } } }, "required": ["startUrls"] } ``` ### Step 2: Build the Actor with Router Pattern ```typescript // src/main.ts import { Actor } from 'apify'; import { CheerioCrawler, createCheerioRouter, Dataset, log } from 'c...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

apify-deploy-integration

Deploy Apify Actors and integrate scraping into external applications. Use when deploying Actors to the platform, integrating Actor results into web apps, or connecting Apify with external services. Trigger: "deploy apify actor", "apify Vercel integration", "apify production deploy", "integrate apify results", "apify API endpoint".

2,266 Updated today
jeremylongshore
AI & Automation Listed

apify-automation

Automate web scraping and data extraction with Apify -- run Actors, manage datasets, create reusable tasks, and retrieve crawl results through the Composio Apify integration.

62,564 Updated 1 weeks ago
ComposioHQ
AI & Automation Featured

apify-reference-architecture

Production-grade architecture patterns for Apify-powered applications. Use when designing scraping infrastructure, building multi-Actor pipelines, or integrating Apify into a larger system architecture. Trigger: "apify architecture", "apify best practices", "apify project structure", "scraping architecture", "apify system design".

2,266 Updated today
jeremylongshore
AI & Automation Featured

apify-hello-world

Run your first Apify Actor and retrieve results via apify-client. Use when starting a new Apify integration, testing connectivity, or learning the Actor call/dataset retrieval pattern. Trigger: "apify hello world", "apify example", "run an apify actor", "apify quick start", "first apify scrape".

2,266 Updated today
jeremylongshore
AI & Automation Featured

apify-local-dev-loop

Set up local Apify Actor development with Apify CLI and Crawlee. Use when creating Actors locally, testing with apify run, or establishing a fast develop-test-deploy cycle. Trigger: "apify dev setup", "apify local development", "develop actor locally", "apify run local".

2,266 Updated today
jeremylongshore