databricks-cost-tuning

Featured

Optimize Databricks costs with cluster policies, spot instances, and monitoring. Use when reducing cloud spend, implementing cost controls, or analyzing Databricks usage costs. Trigger with phrases like "databricks cost", "reduce databricks spend", "databricks billing", "databricks cost optimization", "cluster cost".

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Databricks Cost Tuning ## Overview Reduce Databricks spending through cluster policies, spot instances, SQL warehouse right-sizing, and cost governance. Databricks charges per DBU (Databricks Unit) with rates varying by compute type: Jobs Compute (~$0.15/DBU), All-Purpose Compute (~$0.40/DBU), SQL Compute (~$0.22/DBU), Serverless (~$0.07/DBU). System tables (`system.billing.usage` and `system.billing.list_prices`) provide cost visibility. ## Prerequisites - Databricks Premium or Enterprise workspace - Access to `system.billing.usage` and `system.billing.list_prices` tables - Workspace admin for cluster policy creation ## Instructions ### Step 1: Identify Top Cost Drivers ```sql -- Top 10 most expensive resources this month SELECT cluster_id, COALESCE(usage_metadata.cluster_name, 'unnamed') AS cluster_name, sku_name, SUM(usage_quantity) AS total_dbus, ROUND(SUM(usage_quantity * p.pricing.default), 2) AS estimated_cost_usd FROM system.billing.usage u LEFT JOIN system.billing.list_prices p ON u.sku_name = p.sku_name WHERE u.usage_date >= date_trunc('month', current_date()) GROUP BY cluster_id, cluster_name, u.sku_name ORDER BY estimated_cost_usd DESC LIMIT 10; -- Cost by team (requires cluster tags) SELECT usage_metadata.cluster_tags.Team AS team, sku_name, ROUND(SUM(usage_quantity), 1) AS total_dbus, ROUND(SUM(usage_quantity * p.pricing.default), 2) AS cost_usd FROM system.billing.usage u LEFT JOIN system.billing.list_pric...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

Data & Documents Listed

cost-optimization-data

Query cost analysis, partition pruning, slot reservation strategies, storage tiering, and cloud data warehouse cost reduction. Use this skill whenever the cloud data bill is unexpectedly high, a specific query is scanning too much data, the team wants to understand what's driving BigQuery/Snowflake/Redshift costs, or when choosing between on-demand vs. reserved capacity. Also trigger when the user mentions bytes scanned, slot utilization, query cost, storage costs, Redshift concurrency, Snowflake credits, or when trying to set up cost alerts and budgets. If someone says "our BigQuery bill jumped" or "this query is expensive", this skill should be active immediately.

0 Updated 4 days ago
Methasit-Pun
AI & Automation Featured

databricks-observability

Set up comprehensive observability for Databricks with metrics, traces, and alerts. Use when implementing monitoring for Databricks jobs, setting up dashboards, or configuring alerting for pipeline health. Trigger with phrases like "databricks monitoring", "databricks metrics", "databricks observability", "monitor databricks", "databricks alerts", "databricks logging".

2,266 Updated today
jeremylongshore
AI & Automation Featured

snowflake-cost-tuning

Optimize Snowflake costs with resource monitors, warehouse auto-suspend, right-sizing, and credit consumption analysis. Use when analyzing Snowflake billing, reducing credit consumption, or implementing cost controls and budget alerts. Trigger with phrases like "snowflake cost", "snowflake billing", "reduce snowflake cost", "snowflake credits", "snowflake expensive", "snowflake budget".

2,266 Updated today
jeremylongshore
AI & Automation Featured

clickhouse-cost-tuning

Optimize ClickHouse Cloud costs — compute scaling, storage tiering, compression, and query efficiency for lower bills. Use when analyzing ClickHouse Cloud bills, reducing storage costs, or optimizing compute utilization. Trigger: "clickhouse cost", "clickhouse billing", "reduce clickhouse spend", "clickhouse pricing", "clickhouse expensive", "clickhouse storage cost".

2,266 Updated today
jeremylongshore
AI & Automation Solid

cost-optimizer-cloud-data-platforms

Analyzes and optimizes costs for cloud data platforms

1,034 Updated today
a5c-ai