groq-reference-architecture

Featured

Implement Groq reference architecture with model routing, streaming pipelines, and fallbacks. Use when designing new Groq integrations, reviewing project structure, or establishing architecture standards for Groq applications. Trigger with phrases like "groq architecture", "groq best practices", "groq project structure", "how to organize groq", "groq design".

AI & Automation 2,359 stars 334 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Groq Reference Architecture ## Overview Production architecture for applications built on Groq's LPU inference API. Covers model routing by latency requirements, streaming pipelines, multi-provider fallback, and the middleware layer that ties it together. ## Architecture Diagram ``` ┌──────────────────────────────────────────────────────────────┐ │ Application Layer │ │ Chat UI │ API Backend │ Batch Processor │ Agent │ └─────┬─────┴──────┬────────┴────────┬──────────┴──────┬───────┘ │ │ │ │ ▼ ▼ ▼ ▼ ┌──────────────────────────────────────────────────────────────┐ │ Groq Service Layer │ │ ┌─────────────┐ ┌────────────┐ ┌─────────────────────┐ │ │ │ Model Router │ │ Middleware │ │ Fallback Chain │ │ │ │ │ │ │ │ │ │ │ │ speed → │ │ Cache │ │ Groq (primary) │ │ │ │ 8b-instant│ │ Rate Guard │ │ ↓ 429/5xx │ │ │ │ quality → │ │ Metrics │ │ Groq (fallback model)│ │ │ │ 70b-versa.│ │ Logging │ │ ↓ still failing │ │ │ │ vision → │ │ Retry │ │ OpenAI (backup) │ │ │ │ llama-4 │ │ │ │ ↓ also failing │ │ │ │ audio → │ │ │ │ Graceful degrade │ │ │ │ whisper │ │ │ │ │ ...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
8 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category