mistral-rate-limits

Featured

Implement Mistral AI rate limiting, backoff, and request management. Use when handling rate limit errors, implementing retry logic, or optimizing API request throughput for Mistral AI. Trigger with phrases like "mistral rate limit", "mistral throttling", "mistral 429", "mistral retry", "mistral backoff".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Mistral Rate Limits ## Overview Rate limit management for Mistral AI API. Mistral enforces per-workspace RPM (requests/minute) and TPM (tokens/minute) limits that vary by usage tier (Experiment free tier vs Scale pay-as-you-go). View your workspace limits at [admin.mistral.ai/plateforme/limits](https://admin.mistral.ai/plateforme/limits). ## Prerequisites - Mistral API key configured - Understanding of workspace tier (Experiment vs Scale) - Application with retry infrastructure ## Mistral Rate Limit Architecture Limits are set at the **workspace** level, not per key. All API keys in a workspace share the same RPM/TPM budget. | Endpoint | What's limited | |----------|---------------| | `/v1/chat/completions` | RPM + TPM (input + output) | | `/v1/embeddings` | RPM + TPM (input only) | | `/v1/fim/completions` | RPM + TPM | | `/v1/moderations` | RPM | **Headers returned on every response:** - `x-ratelimit-limit-requests` — your RPM cap - `x-ratelimit-remaining-requests` — remaining RPM - `x-ratelimit-limit-tokens` — your TPM cap - `x-ratelimit-remaining-tokens` — remaining TPM - `Retry-After` — seconds to wait (on 429 only) ## Instructions ### Step 1: Token-Aware Rate Limiter ```typescript class MistralRateLimiter { private requestTimes: number[] = []; private tokenBuckets: Array<{ time: number; tokens: number }> = []; private readonly rpm: number; private readonly tpm: number; constructor(rpm: number, tpm: number) { this.rpm = rpm; this.tpm = tpm; ...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

mistral-common-errors

Diagnose and fix Mistral AI common errors and exceptions. Use when encountering Mistral errors, debugging failed requests, or troubleshooting integration issues. Trigger with phrases like "mistral error", "fix mistral", "mistral not working", "debug mistral".

2,266 Updated today
jeremylongshore
AI & Automation Featured

mistral-enterprise-rbac

Configure Mistral AI enterprise access control and workspace management. Use when implementing role-based API key scoping, managing team access, or setting up organization-level controls for Mistral AI. Trigger with phrases like "mistral access control", "mistral RBAC", "mistral enterprise", "mistral roles", "mistral team".

2,266 Updated today
jeremylongshore
AI & Automation Featured

mistral-performance-tuning

Optimize Mistral AI performance with caching, batching, and latency reduction. Use when experiencing slow API responses, implementing caching strategies, or optimizing request throughput for Mistral AI integrations. Trigger with phrases like "mistral performance", "optimize mistral", "mistral latency", "mistral caching", "mistral slow".

2,266 Updated today
jeremylongshore
AI & Automation Featured

mistral-cost-tuning

Optimize Mistral AI costs through model selection, token management, and usage monitoring. Use when analyzing Mistral billing, reducing API costs, or implementing usage monitoring and budget alerts. Trigger with phrases like "mistral cost", "mistral billing", "reduce mistral costs", "mistral pricing", "mistral budget".

2,266 Updated today
jeremylongshore
AI & Automation Featured

clade-rate-limits

Handle Anthropic rate limits — understand tiers, implement backoff, Use when working with rate-limits patterns. optimize throughput, and monitor usage. Trigger with "anthropic rate limit", "claude 429", "anthropic throttling", "anthropic usage limits", "claude tokens per minute".

2,266 Updated today
jeremylongshore