exa-performance-tuning

Featured

Optimize Exa API performance with search type selection, caching, and parallelization. Use when experiencing slow responses, implementing caching strategies, or optimizing request throughput for Exa integrations. Trigger with phrases like "exa performance", "optimize exa", "exa latency", "exa caching", "exa slow", "exa fast".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Exa Performance Tuning ## Overview Optimize Exa search API response times for production workloads. Key levers: search type selection (instant < fast < auto < neural < deep), result count reduction, content scope control, result caching, and parallel query execution. ## Latency by Search Type | Type | Typical Latency | Use Case | |------|----------------|----------| | `instant` | < 150ms | Real-time autocomplete, typeahead | | `fast` | p50 < 425ms | Speed-critical user-facing search | | `auto` | 300-1500ms | General purpose (default) | | `neural` | 500-2000ms | Best semantic quality | | `deep` | 2-5s | Maximum coverage, light deep search | | `deep-reasoning` | 5-15s | Complex research questions | ## Instructions ### Step 1: Match Search Type to Latency Budget ```typescript import Exa from "exa-js"; const exa = new Exa(process.env.EXA_API_KEY); function selectSearchType(latencyBudgetMs: number) { if (latencyBudgetMs < 200) return "instant"; if (latencyBudgetMs < 500) return "fast"; if (latencyBudgetMs < 1500) return "auto"; if (latencyBudgetMs < 3000) return "neural"; return "deep"; } async function optimizedSearch(query: string, latencyBudgetMs: number) { const type = selectSearchType(latencyBudgetMs); const numResults = latencyBudgetMs < 500 ? 3 : latencyBudgetMs < 2000 ? 5 : 10; return exa.search(query, { type, numResults }); } ``` ### Step 2: Minimize Content Retrieval ```typescript // Each content option adds latency. Only request what you need...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

exa-cost-tuning

Optimize Exa costs through search type selection, caching, and usage monitoring. Use when analyzing Exa billing, reducing API costs, or implementing budget controls and usage alerts. Trigger with phrases like "exa cost", "exa billing", "reduce exa costs", "exa pricing", "exa expensive", "exa budget".

2,266 Updated today
jeremylongshore
AI & Automation Featured

exa-data-handling

Implement Exa search result processing, content extraction, caching, and RAG context management. Use when handling search results, implementing caching, building citation pipelines, or managing content payloads for LLM context windows. Trigger with phrases like "exa data", "exa results processing", "exa cache", "exa RAG context", "exa content extraction".

2,266 Updated today
jeremylongshore
AI & Automation Featured

exa-observability

Set up monitoring, metrics, and alerting for Exa search integrations. Use when implementing monitoring for Exa operations, building dashboards, or configuring alerting for search quality and latency. Trigger with phrases like "exa monitoring", "exa metrics", "exa observability", "monitor exa", "exa alerts", "exa dashboard".

2,266 Updated today
jeremylongshore
AI & Automation Featured

exa-architecture-variants

Choose and implement Exa architecture patterns at different scales: direct search, cached search, and RAG pipeline. Use when designing Exa integrations, choosing between simple search and full RAG, or planning architecture for different traffic volumes. Trigger with phrases like "exa architecture", "exa blueprint", "how to structure exa", "exa RAG design", "exa at scale".

2,266 Updated today
jeremylongshore
AI & Automation Featured

exa-reliability-patterns

Implement Exa reliability patterns: query fallback chains, circuit breakers, and graceful degradation. Use when building fault-tolerant Exa integrations, implementing fallback strategies, or adding resilience to production search services. Trigger with phrases like "exa reliability", "exa circuit breaker", "exa fallback", "exa resilience", "exa graceful degradation".

2,266 Updated today
jeremylongshore