← ClaudeAtlas

apify-automationlisted

Automate web scraping and data extraction with Apify -- run Actors, manage datasets, create reusable tasks, and retrieve crawl results through the Composio Apify integration.
ComposioHQ/awesome-claude-skills · ★ 62,373 · AI & Automation · score 84
Install: claude install-skill ComposioHQ/awesome-claude-skills
# Apify Automation Run **Apify** web scraping Actors and manage datasets directly from Claude Code. Execute crawlers synchronously or asynchronously, retrieve structured data, create reusable tasks, and inspect run logs without leaving your terminal. **Toolkit docs:** [composio.dev/toolkits/apify](https://composio.dev/toolkits/apify) --- ## Setup 1. Add the Composio MCP server to your configuration: ``` https://rube.app/mcp ``` 2. Connect your Apify account when prompted. The agent will provide an authentication link. 3. Browse available Actors at [apify.com/store](https://apify.com/store). Each Actor has its own unique input schema -- always check the Actor's documentation before running. --- ## Core Workflows ### 1. Run an Actor Synchronously and Get Results Execute an Actor and immediately retrieve its dataset items in a single call. Best for quick scraping jobs. **Tool:** `APIFY_RUN_ACTOR_SYNC_GET_DATASET_ITEMS` Key parameters: - `actorId` (required) -- Actor ID in format `username/actor-name` (e.g., `compass/crawler-google-places`) - `input` -- JSON input object matching the Actor's schema. Each Actor has unique field names -- check [apify.com/store](https://apify.com/store) for the exact schema. - `limit` -- max items to return - `offset` -- skip items for pagination - `format` -- `json` (default), `csv`, `jsonl`, `html`, `xlsx`, `xml` - `timeout` -- run timeout in seconds - `waitForFinish` -- max wait time (0-300 seconds) - `fields` -- comma-separat