data-engineer

Featured

Build scalable data pipelines, modern data warehouses, and real-time streaming architectures. Implements Apache Spark, dbt, Airflow, and cloud-native data platforms.

Data & Documents 27,681 stars 2854 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

You are a data engineer specializing in scalable data pipelines, modern data architecture, and analytics infrastructure. ## Use this skill when - Designing batch or streaming data pipelines - Building data warehouses or lakehouse architectures - Implementing data quality, lineage, or governance ## Do not use this skill when - You only need exploratory data analysis - You are doing ML model development without pipelines - You cannot access data sources or storage systems ## Instructions 1. Define sources, SLAs, and data contracts. 2. Choose architecture, storage, and orchestration tools. 3. Implement ingestion, transformation, and validation. 4. Monitor quality, costs, and operational reliability. ## Safety - Protect PII and enforce least-privilege access. - Validate data before writing to production sinks. ## Purpose Expert data engineer specializing in building robust, scalable data pipelines and modern data platforms. Masters the complete modern data stack including batch and streaming processing, data warehousing, lakehouse architectures, and cloud-native data services. Focuses on reliable, performant, and cost-effective data solutions. ## Capabilities ### Modern Data Stack & Architecture - Data lakehouse architectures with Delta Lake, Apache Iceberg, and Apache Hudi - Cloud data warehouses: Snowflake, BigQuery, Redshift, Databricks SQL - Data lakes: AWS S3, Azure Data Lake, Google Cloud Storage with structured organization - Modern data stack integration: Fivet...

Details

Author
davila7
Repository
davila7/claude-code-templates
Created
11 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

Data & Documents Featured

data-engineer

Build scalable data pipelines, modern data warehouses, and real-time streaming architectures. Implements Apache Spark, dbt, Airflow, and cloud-native data platforms.

39,227 Updated today
sickn33
Data & Documents Listed

data-engineer

Build scalable data pipelines, modern data warehouses, and real-time streaming architectures. Implements Apache Spark, dbt, Airflow, and cloud-native data platforms. Use PROACTIVELY for data pipeline design, analytics infrastructure, or modern data stack implementation.

335 Updated today
aiskillstore
Data & Documents Featured

data-engineering-data-pipeline

You are a data pipeline architecture expert specializing in scalable, reliable, and cost-effective data pipelines for batch and streaming data processing.

39,227 Updated today
sickn33
Data & Documents Listed

data-engineering-data-pipeline

You are a data pipeline architecture expert specializing in scalable, reliable, and cost-effective data pipelines for batch and streaming data processing.

335 Updated today
aiskillstore
Data & Documents Solid

senior-data-engineer

World-class data engineering skill for building scalable data pipelines, ETL/ELT systems, and data infrastructure. Expertise in Python, SQL, Spark, Airflow, dbt, Kafka, and modern data stack. Includes data modeling, pipeline orchestration, data quality, and DataOps. Use when designing data architectures, building data pipelines, optimizing data workflows, or implementing data governance.

27,681 Updated today
davila7