castai-prod-checklist

Featured

Production readiness checklist for CAST AI cluster onboarding. Use when going live with CAST AI autoscaling, validating Phase 2 setup, or preparing for production cost optimization. Trigger with phrases like "cast ai production", "cast ai go-live", "cast ai checklist", "cast ai launch".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# CAST AI Production Checklist ## Overview Complete checklist for enabling CAST AI cost optimization on a production Kubernetes cluster. Covers Phase 1 (monitoring) through Phase 2 (full automation) with validation steps at each stage. ## Prerequisites - CAST AI tested on a staging cluster first - Production API key (Full Access) - Change management approval for node lifecycle changes ## Phase 1: Monitoring Only - [ ] Agent installed with read-only key - [ ] Agent pod healthy: `kubectl get pods -n castai-agent` - [ ] Console shows cluster as "Connected" - [ ] Savings report generating (wait 24h for full data) - [ ] Review savings estimate before enabling automation ## Phase 2: Autoscaling Enabled - [ ] Full Access API key provisioned and stored in secrets manager - [ ] Cluster controller installed - [ ] Evictor installed with conservative settings (non-aggressive) - [ ] Spot handler installed for graceful interruption handling - [ ] Autoscaler policies configured with appropriate limits: - [ ] `clusterLimits.cpu.maxCores` set to safe ceiling - [ ] `unschedulablePods.headroom` configured (10-15%) - [ ] `nodeDownscaler.emptyNodes.delaySeconds` >= 300 for production - [ ] `spotInstances.spotDiversityEnabled` = true - [ ] Node templates created for workload-specific needs (GPU, high-memory) - [ ] PodDisruptionBudgets set on all critical workloads ## Workload Autoscaler - [ ] Workload autoscaler installed - [ ] Critical deployments annotated with min/max resource...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

castai-core-workflow-a

Configure CAST AI autoscaler policies and node templates for cost optimization. Use when enabling Phase 2 automation, setting spot instance policies, or configuring node downscaler and evictor settings. Trigger with phrases like "cast ai autoscaler", "cast ai policies", "cast ai spot instances", "cast ai node optimization".

2,266 Updated today
jeremylongshore
AI & Automation Featured

castai-performance-tuning

Optimize CAST AI autoscaler performance, node provisioning speed, and API efficiency. Use when nodes take too long to provision, autoscaler is not reacting fast enough, or optimizing API call patterns for multi-cluster dashboards. Trigger with phrases like "cast ai performance", "cast ai slow", "cast ai node provisioning", "cast ai autoscaler speed".

2,266 Updated today
jeremylongshore
AI & Automation Featured

castai-install-auth

Install and configure CAST AI agent on a Kubernetes cluster with API key authentication. Use when onboarding a cluster to CAST AI, setting up Helm charts, or configuring Terraform provider authentication. Trigger with phrases like "install cast ai", "connect cluster to cast ai", "cast ai setup", "cast ai api key", "cast ai helm install".

2,266 Updated today
jeremylongshore
AI & Automation Featured

castai-core-workflow-b

Configure CAST AI Workload Autoscaler for pod-level right-sizing and VPA. Use when enabling workload autoscaling, configuring resource recommendations, or tuning pod CPU and memory requests with CAST AI. Trigger with phrases like "cast ai workload autoscaler", "cast ai pod sizing", "cast ai resource recommendations", "cast ai VPA".

2,266 Updated today
jeremylongshore
AI & Automation Featured

castai-cost-tuning

Maximize Kubernetes cost savings with CAST AI spot strategies and right-sizing. Use when analyzing cloud spend, optimizing spot-to-on-demand ratios, or configuring CAST AI for maximum savings. Trigger with phrases like "cast ai cost", "cast ai savings", "cast ai spot strategy", "reduce kubernetes cost", "cast ai budget".

2,266 Updated today
jeremylongshore