exa-rate-limits

Featured

Implement Exa rate limiting, exponential backoff, and request queuing. Use when handling 429 errors, implementing retry logic, or optimizing API request throughput for Exa. Trigger with phrases like "exa rate limit", "exa throttling", "exa 429", "exa retry", "exa backoff", "exa QPS".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Exa Rate Limits ## Overview Handle Exa API rate limits gracefully. Default limit is 10 QPS (queries per second) across all endpoints. Rate limit errors return HTTP 429 with a simple `{ "error": "rate limit exceeded" }` response. For higher limits, contact hello@exa.ai for Enterprise plans. ## Rate Limit Structure | Endpoint | Default QPS | Notes | |----------|-------------|-------| | `/search` | 10 | Most endpoints share this limit | | `/find-similar` | 10 | Same pool as search | | `/contents` | 10 | Same pool | | `/answer` | 10 | Same pool | | Research API | Concurrent task limit | Long-running operations | ## Prerequisites - `exa-js` SDK installed - Understanding of async/await patterns ## Instructions ### Step 1: Exponential Backoff with Jitter ```typescript import Exa from "exa-js"; const exa = new Exa(process.env.EXA_API_KEY); async function withBackoff<T>( operation: () => Promise<T>, config = { maxRetries: 5, baseDelayMs: 1000, maxDelayMs: 32000 } ): Promise<T> { for (let attempt = 0; attempt <= config.maxRetries; attempt++) { try { return await operation(); } catch (err: any) { const status = err.status || err.response?.status; // Only retry on 429 (rate limit) and 5xx (server errors) if (status !== 429 && (status < 500 || status >= 600)) throw err; if (attempt === config.maxRetries) throw err; // Exponential delay with random jitter to prevent thundering herd const exponentialDelay = config.baseDelayM...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

perplexity-rate-limits

Implement Perplexity rate limiting, backoff, and request queuing. Use when handling 429 errors, implementing retry logic, or optimizing API request throughput for Perplexity Sonar. Trigger with phrases like "perplexity rate limit", "perplexity throttling", "perplexity 429", "perplexity retry", "perplexity backoff".

2,266 Updated today
jeremylongshore
AI & Automation Featured

instantly-rate-limits

Implement Instantly.ai rate limiting, backoff, and request throttling patterns. Use when handling 429 errors, implementing retry logic, or building high-throughput Instantly integrations. Trigger with phrases like "instantly rate limit", "instantly 429", "instantly throttle", "instantly backoff", "instantly retry".

2,266 Updated today
jeremylongshore
AI & Automation Featured

exa-load-scale

Implement Exa load testing, capacity planning, and scaling strategies. Use when running performance tests, planning capacity for Exa integrations, or designing high-throughput search architectures. Trigger with phrases like "exa load test", "exa scale", "exa capacity", "exa k6", "exa benchmark", "exa throughput".

2,266 Updated today
jeremylongshore
AI & Automation Featured

elevenlabs-rate-limits

Implement ElevenLabs rate limiting, concurrency queuing, and backoff patterns. Use when handling 429 errors, implementing retry logic, or managing concurrent TTS request throughput. Trigger: "elevenlabs rate limit", "elevenlabs throttling", "elevenlabs 429", "elevenlabs retry", "elevenlabs backoff", "elevenlabs concurrent requests".

2,266 Updated today
jeremylongshore
AI & Automation Featured

cohere-rate-limits

Implement Cohere rate limiting, backoff, and request queuing patterns. Use when handling 429 errors, implementing retry logic, or optimizing API request throughput for Cohere. Trigger with phrases like "cohere rate limit", "cohere throttling", "cohere 429", "cohere retry", "cohere backoff".

2,266 Updated today
jeremylongshore