ml-pipeline

Solid

Designs and implements production-grade ML pipeline infrastructure: configures experiment tracking with MLflow or Weights & Biases, creates Kubeflow or Airflow DAGs for training orchestration, builds feature store schemas with Feast, deploys model registries, and automates retraining and validation workflows. Use when building ML pipelines, orchestrating training workflows, automating model lifecycle, implementing feature stores, managing experiment tracking systems, setting up DVC for data versioning, tuning hyperparameters, or configuring MLOps tooling like Kubeflow, Airflow, MLflow, or Prefect.

AI & Automation 9,509 stars 807 forks Updated 1 weeks ago MIT

Install

View on GitHub

Quality Score: 94/100

Stars 20%
100
Recency 20%
90
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# ML Pipeline Expert Senior ML pipeline engineer specializing in production-grade machine learning infrastructure, orchestration systems, and automated training workflows. ## Core Workflow 1. **Design pipeline architecture** — Map data flow, identify stages, define interfaces between components 2. **Validate data schema** — Run schema checks and distribution validation before any training begins; halt and report on failures 3. **Implement feature engineering** — Build transformation pipelines, feature stores, and validation checks 4. **Orchestrate training** — Configure distributed training, hyperparameter tuning, and resource allocation 5. **Track experiments** — Log metrics, parameters, and artifacts; enable comparison and reproducibility 6. **Validate and deploy** — Run model evaluation gates; implement A/B testing or shadow deployment before promotion ## Reference Guide Load detailed guidance based on context: | Topic | Reference | Load When | |-------|-----------|-----------| | Feature Engineering | `references/feature-engineering.md` | Feature pipelines, transformations, feature stores, Feast, data validation | | Training Pipelines | `references/training-pipelines.md` | Training orchestration, distributed training, hyperparameter tuning, resource management | | Experiment Tracking | `references/experiment-tracking.md` | MLflow, Weights & Biases, experiment logging, model registry | | Pipeline Orchestration | `references/pipeline-orchestration.md` | Kubeflow Pipeli...

Details

Author
Jeffallan
Repository
Jeffallan/claude-skills
Created
7 months ago
Last Updated
1 weeks ago
Language
Python
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category