coreweave-reference-architecture

Featured

Reference architecture for CoreWeave GPU cloud deployments. Use when designing ML infrastructure, planning multi-model serving, or establishing CoreWeave deployment standards. Trigger with phrases like "coreweave architecture", "coreweave design", "coreweave infrastructure", "coreweave best practices".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# CoreWeave Reference Architecture ## Architecture Diagram ``` ┌─────────────────────┐ │ Load Balancer │ │ (Ingress/LB) │ └──────────┬──────────┘ │ ┌────────────────┼────────────────┐ │ │ │ ┌────────▼──────┐ ┌──────▼────────┐ ┌─────▼───────┐ │ Model A │ │ Model B │ │ Model C │ │ (vLLM, A100) │ │ (TGI, H100) │ │ (SD, L40) │ │ 2 replicas │ │ 1 replica │ │ 3 replicas │ └───────────────┘ └───────────────┘ └─────────────┘ │ │ │ ┌────────▼────────────────▼────────────────▼───────┐ │ Shared Storage (PVC) │ │ Models / Checkpoints / Data │ └──────────────────────────────────────────────────┘ ``` ## Project Structure ``` ml-platform/ ├── k8s/ │ ├── base/ # Shared templates │ ├── models/ │ │ ├── llama-8b/ # Per-model manifests │ │ ├── llama-70b/ │ │ └── stable-diffusion/ │ └── infra/ │ ├── storage.yaml # PVCs │ ├── secrets.yaml # Model tokens │ └── monitoring.yaml # Prometheus rules ├── containers/ │ ├── vllm/Dockerfile │ └── custom-server/Dockerfile ├── scripts/ │ ├── deploy.sh │ └── benchmark.sh └── monitoring/ ├── grafana-dashboards/ └── ...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

coreweave-deploy-integration

Deploy inference services on CoreWeave with Helm charts and Kustomize. Use when deploying multi-model inference, managing GPU deployments at scale, or templating CoreWeave manifests. Trigger with phrases like "deploy coreweave", "coreweave helm", "coreweave kustomize", "coreweave deployment patterns".

2,266 Updated today
jeremylongshore
AI & Automation Featured

coreweave-local-dev-loop

Set up local development workflow for CoreWeave GPU deployments. Use when building containers locally, testing YAML manifests, or iterating on model serving configurations before deploying. Trigger with phrases like "coreweave dev setup", "coreweave local testing", "develop for coreweave", "coreweave container build".

2,266 Updated today
jeremylongshore
AI & Automation Featured

coreweave-security-basics

Secure CoreWeave deployments with RBAC, network policies, and secrets management. Use when hardening GPU workloads, managing model access, or configuring namespace isolation. Trigger with phrases like "coreweave security", "coreweave rbac", "secure coreweave", "coreweave secrets".

2,266 Updated today
jeremylongshore
AI & Automation Solid

coreweave-performance-tuning

Optimize CoreWeave GPU inference latency and throughput. Use when reducing inference latency, maximizing GPU utilization, or tuning batch sizes and concurrency. Trigger with phrases like "coreweave performance", "coreweave latency", "coreweave throughput", "optimize coreweave inference".

2,266 Updated today
jeremylongshore
AI & Automation Solid

coreweave-migration-deep-dive

Migrate ML workloads from AWS/GCP/Azure to CoreWeave GPU cloud. Use when moving inference services from hyperscaler GPU instances, migrating training pipelines, or evaluating CoreWeave vs cloud GPU costs. Trigger with phrases like "migrate to coreweave", "coreweave migration", "move from aws to coreweave", "coreweave vs aws gpu".

2,266 Updated today
jeremylongshore