great-expectations-validator

Solid

Data quality validation skill using Great Expectations for schema validation, expectation suites, data documentation, and automated data quality checks in ML pipelines.

AI & Automation 814 stars 53 forks Updated today MIT

Install

View on GitHub

Quality Score: 95/100

Stars 20%
97
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Great Expectations Validator Validate data quality using Great Expectations for comprehensive data testing, documentation, and quality monitoring. ## Overview This skill provides capabilities for data quality validation using Great Expectations (GX), the leading open-source library for data quality. It enables creation and execution of expectation suites, data documentation generation, and integration with ML pipelines. ## Capabilities ### Expectation Suite Management - Create and configure expectation suites - Define expectations for columns and tables - Validate data against expectations - Store and version expectation suites ### Data Validation - Schema validation (column presence, types) - Statistical validation (distributions, ranges) - Referential integrity checks - Custom SQL-based expectations - Regex pattern matching ### Data Documentation - Generate data documentation (Data Docs) - Create profiling reports - Document validation results - Build data dictionaries ### Pipeline Integration - Checkpoint configuration and execution - Batch request management - Action-based workflows (notifications, storage) - Integration with Airflow, Prefect, Dagster ### Custom Expectations - Define domain-specific expectations - Parameterized expectations - Multi-column expectations - Row-condition based expectations ## Prerequisites ### Installation ```bash pip install great_expectations>=0.18.0 ``` ### Optional Connectors ```bash # Database connectors pip install great_e...

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Related Skills