anth-rate-limits

Featured

Implement Anthropic Claude API rate limiting, backoff, and quota management. Use when handling 429 errors, optimizing request throughput, or managing RPM/TPM limits across usage tiers. Trigger with phrases like "anthropic rate limit", "claude 429", "anthropic throttling", "claude retry", "anthropic backoff".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Anthropic Rate Limits ## Overview The Claude API uses token-bucket rate limiting measured in three dimensions: requests per minute (RPM), input tokens per minute (ITPM), and output tokens per minute (OTPM). Limits increase automatically as you move through usage tiers. ## Rate Limit Dimensions | Dimension | Header | Description | |-----------|--------|-------------| | RPM | `anthropic-ratelimit-requests-limit` | Requests per minute | | ITPM | `anthropic-ratelimit-tokens-limit` | Input tokens per minute | | OTPM | `anthropic-ratelimit-tokens-limit` | Output tokens per minute | Limits are per-organization and per-model-class. Cached input tokens do NOT count toward ITPM limits. ## Usage Tiers (Auto-Upgrade) | Tier | Monthly Spend | Key Benefit | |------|---------------|-------------| | Tier 1 (Free) | $0 | Evaluation access | | Tier 2 | $40+ | Higher RPM | | Tier 3 | $200+ | Production-grade limits | | Tier 4 | $2,000+ | High-throughput access | | Scale | Custom | Custom limits via sales | Check your current tier and limits at [console.anthropic.com](https://console.anthropic.com/settings/limits). ## SDK Built-In Retry ```python import anthropic # The SDK retries 429 and 5xx errors automatically (2 retries by default) client = anthropic.Anthropic(max_retries=5) # Increase for high-traffic apps # Disable auto-retry for manual control client = anthropic.Anthropic(max_retries=0) ``` ```typescript const client = new Anthropic({ maxRetries: 5 }); ``` ## Custom Rate ...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

clade-rate-limits

Handle Anthropic rate limits — understand tiers, implement backoff, Use when working with rate-limits patterns. optimize throughput, and monitor usage. Trigger with "anthropic rate limit", "claude 429", "anthropic throttling", "anthropic usage limits", "claude tokens per minute".

2,266 Updated today
jeremylongshore
AI & Automation Featured

clade-common-errors

Diagnose and fix Anthropic API errors — authentication, rate limits, Use when working with common-errors patterns. overloaded, context length, and content policy issues. Trigger with "anthropic error", "claude 429", "claude overloaded", "anthropic not working", "debug claude api".

2,266 Updated today
jeremylongshore
AI & Automation Featured

apollo-rate-limits

Implement Apollo.io rate limiting and backoff. Use when handling rate limits, implementing retry logic, or optimizing API request throughput. Trigger with phrases like "apollo rate limit", "apollo 429", "apollo throttling", "apollo backoff", "apollo request limits".

2,266 Updated today
jeremylongshore
AI & Automation Featured

anth-incident-runbook

Execute incident response procedures for Claude API outages and degradation. Use when Claude API is returning errors, experiencing high latency, or showing degraded performance in production. Trigger with phrases like "anthropic incident", "claude api down", "anthropic outage", "claude degraded", "anthropic runbook".

2,266 Updated today
jeremylongshore
AI & Automation Featured

anth-load-scale

Implement load testing, auto-scaling, and capacity planning for Claude API. Use when running performance benchmarks, planning for traffic spikes, or configuring horizontal scaling for Claude-powered services. Trigger with phrases like "anthropic load test", "claude scaling", "anthropic capacity planning", "scale claude api".

2,266 Updated today
jeremylongshore