clade-rate-limits

Featured

Handle Anthropic rate limits — understand tiers, implement backoff, Use when working with rate-limits patterns. optimize throughput, and monitor usage. Trigger with "anthropic rate limit", "claude 429", "anthropic throttling", "anthropic usage limits", "claude tokens per minute".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Anthropic Rate Limits ## Overview Anthropic enforces three types of limits: requests per minute (RPM), input tokens per minute (TPM), and output tokens per minute. Limits depend on your spend tier. ## Rate Limit Tiers | Tier | Qualification | RPM | Input TPM | Output TPM | |------|--------------|-----|-----------|------------| | Tier 1 | Free | 50 | 40,000 | 8,000 | | Tier 2 | $40+ spend | 1,000 | 80,000 | 16,000 | | Tier 3 | $200+ spend | 2,000 | 160,000 | 32,000 | | Tier 4 | $400+ spend | 4,000 | 400,000 | 80,000 | | Scale | Custom | Custom | Custom | Custom | > **Check your tier:** console.anthropic.com → Settings → Limits ## Response Headers Every API response includes rate limit headers: ``` claude-ratelimit-requests-limit: 1000 claude-ratelimit-requests-remaining: 998 claude-ratelimit-requests-reset: 2025-01-01T00:01:00Z claude-ratelimit-tokens-limit: 80000 claude-ratelimit-tokens-remaining: 79500 claude-ratelimit-tokens-reset: 2025-01-01T00:01:00Z retry-after: 5 ``` ## Built-In SDK Retries The SDK automatically retries 429 and 529 errors with exponential backoff: ```typescript import Anthropic from '@claude-ai/sdk'; const client = new Anthropic({ maxRetries: 3, // default: 2. Set to 0 to disable. }); ``` ## Custom Backoff ```typescript async function callWithBackoff(params: Anthropic.MessageCreateParams, maxRetries = 5) { for (let attempt = 0; attempt < maxRetries; attempt++) { try { return await client.messages.create(params); } catch (err)...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

anth-rate-limits

Implement Anthropic Claude API rate limiting, backoff, and quota management. Use when handling 429 errors, optimizing request throughput, or managing RPM/TPM limits across usage tiers. Trigger with phrases like "anthropic rate limit", "claude 429", "anthropic throttling", "claude retry", "anthropic backoff".

2,266 Updated today
jeremylongshore
AI & Automation Featured

clade-common-errors

Diagnose and fix Anthropic API errors — authentication, rate limits, Use when working with common-errors patterns. overloaded, context length, and content policy issues. Trigger with "anthropic error", "claude 429", "claude overloaded", "anthropic not working", "debug claude api".

2,266 Updated today
jeremylongshore
AI & Automation Featured

apollo-rate-limits

Implement Apollo.io rate limiting and backoff. Use when handling rate limits, implementing retry logic, or optimizing API request throughput. Trigger with phrases like "apollo rate limit", "apollo 429", "apollo throttling", "apollo backoff", "apollo request limits".

2,266 Updated today
jeremylongshore
AI & Automation Featured

abridge-rate-limits

Implement Abridge rate limiting, backoff, and session throttling patterns. Use when handling 429 errors, managing concurrent encounter sessions, or optimizing API throughput for high-volume clinical deployments. Trigger: "abridge rate limit", "abridge 429", "abridge throttling", "abridge concurrent sessions".

2,266 Updated today
jeremylongshore
AI & Automation Featured

clade-load-scale

Scale Claude usage for high-throughput applications — batches, queues, Use when working with load-scale patterns. concurrency control, and tier upgrades. Trigger with "anthropic scale", "claude high volume", "anthropic throughput", "scale claude api", "anthropic concurrent requests".

2,266 Updated today
jeremylongshore