vastai-cost-tuning

Solid

Optimize Vast.ai GPU cloud costs through smart instance selection and lifecycle management. Use when analyzing GPU spending, reducing training costs, or implementing budget controls for Vast.ai workloads. Trigger with phrases like "vastai cost", "vastai billing", "reduce vastai costs", "vastai pricing", "vastai budget".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Vast.ai Cost Tuning ## Overview Minimize Vast.ai GPU cloud costs by choosing the right GPU for your workload, leveraging interruptible (spot) instances, eliminating idle compute, and implementing auto-destroy safeguards. Vast.ai pricing is dynamic and varies significantly: RTX 4090 ($0.15-0.30/hr), A100 80GB ($1.00-2.00/hr), H100 SXM ($2.50-4.00/hr). ## Prerequisites - Vast.ai account with billing history - Understanding of your workload's GPU requirements - `vastai` CLI installed ## Instructions ### Step 1: GPU Selection by Cost-Efficiency ```python # Compare cost-per-TFLOP across GPU types GPU_SPECS = { "RTX_4090": {"fp16_tflops": 82.6, "vram": 24}, "A100": {"fp16_tflops": 77.97, "vram": 80}, "H100_SXM": {"fp16_tflops": 267, "vram": 80}, "RTX_3090": {"fp16_tflops": 35.6, "vram": 24}, "A6000": {"fp16_tflops": 38.7, "vram": 48}, } def cost_per_tflop(gpu_name, dph): specs = GPU_SPECS.get(gpu_name, {"fp16_tflops": 1}) return dph / specs["fp16_tflops"] # Often RTX 4090 is the best value for inference # A100 is best for training large models needing >24GB VRAM # H100 is best only when wall-clock time justifies 10x price premium ``` ### Step 2: Spot vs On-Demand Analysis ```bash # Interruptible (spot) instances are 30-60% cheaper vastai search offers 'num_gpus=1 gpu_name=RTX_4090 rentable=true' \ --order dph_total --limit 5 # Compare interruptible vs on-demand pricing # Use interruptible for: batch inference, checkpointed t...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

vastai-performance-tuning

Optimize Vast.ai GPU instance selection, startup time, and training throughput. Use when optimizing instance selection, reducing startup latency, or maximizing GPU utilization on rented hardware. Trigger with phrases like "vastai performance", "optimize vastai", "vastai slow", "vastai gpu utilization", "vastai throughput".

2,266 Updated today
jeremylongshore
AI & Automation Solid

vastai-migration-deep-dive

Migrate GPU workloads to or from Vast.ai, or between GPU providers. Use when switching from AWS/GCP/Azure GPU instances to Vast.ai, migrating between GPU types, or re-platforming ML infrastructure. Trigger with phrases like "migrate to vastai", "vastai migration", "switch to vastai", "vastai from aws", "vastai from lambda".

2,266 Updated today
jeremylongshore
AI & Automation Solid

vast-gpu

Rent, manage, and destroy GPU instances on vast.ai. Use when user says "rent gpu", "vast.ai", "rent a server", "cloud gpu", or needs on-demand GPU without owning hardware.

11,051 Updated today
wanshuiyin
AI & Automation Solid

vastai-hello-world

Rent your first GPU instance on Vast.ai and run a workload. Use when starting a new Vast.ai integration, testing your setup, or learning basic Vast.ai GPU rental patterns. Trigger with phrases like "vastai hello world", "vastai example", "vastai quick start", "rent first gpu", "vastai first instance".

2,266 Updated today
jeremylongshore
AI & Automation Solid

vastai-observability

Monitor Vast.ai GPU instance health, utilization, and costs. Use when setting up monitoring dashboards, configuring alerts, or tracking GPU utilization and spending. Trigger with phrases like "vastai monitoring", "vastai metrics", "vastai observability", "monitor vastai", "vastai alerts".

2,266 Updated today
jeremylongshore