data-quality

Solid

Use this skill when implementing data validation, data quality monitoring, data lineage tracking, data contracts, or Great Expectations test suites. Triggers on schema validation, data profiling, freshness checks, row-count anomalies, column drift, expectation suites, contract testing between producers and consumers, lineage graphs, data observability, and any task requiring data integrity enforcement across pipelines.

Data & Documents 164 stars 28 forks Updated yesterday MIT

Install

View on GitHub

Quality Score: 92/100

Stars 20%
74
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

When this skill is activated, always start your first response with the ๐Ÿงข emoji. # Data Quality Data quality is the practice of ensuring that data is accurate, complete, consistent, timely, and trustworthy as it flows through pipelines and systems. Without explicit quality gates, bad data propagates silently - corrupting dashboards, training flawed models, and breaking downstream consumers. This skill covers the five pillars: schema validation at ingress, expectation-based testing with Great Expectations, data contracts between producers and consumers, lineage tracking for impact analysis, and continuous monitoring for anomaly detection. --- ## When to use this skill Trigger this skill when the user: - Adds data validation or schema enforcement to a pipeline (ingestion, transformation, or serving) - Writes Great Expectations expectation suites or checkpoints - Defines data contracts between a producer team and consumer teams - Implements data lineage tracking or impact analysis - Sets up data quality monitoring dashboards or freshness/volume alerts - Investigates data quality incidents (missing columns, null spikes, schema drift) - Profiles a new dataset to understand distributions and anomalies - Builds row-count, freshness, or distribution-based quality checks Do NOT trigger this skill for: - General ETL/ELT pipeline orchestration (use an Airflow/dbt skill instead) - Data modeling or warehouse design decisions without a quality focus --- ## Key principles 1. **Val...

Details

Author
AbsolutelySkilled
Repository
AbsolutelySkilled/AbsolutelySkilled
Created
2 months ago
Last Updated
yesterday
Language
MDX
License
MIT

Related Skills