pandas-dataframe-analyzer

Solid

Automated DataFrame analysis skill for statistical summaries, missing value detection, data type inference, and memory optimization recommendations.

AI & Automation 814 stars 53 forks Updated today MIT

Install

View on GitHub

Quality Score: 95/100

Stars 20%
97
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# pandas-dataframe-analyzer ## Overview Automated DataFrame analysis skill for statistical summaries, missing value detection, data type inference, and memory optimization recommendations using pandas and profiling libraries. ## Capabilities - Statistical profiling of DataFrames - Missing value pattern detection - Data type optimization suggestions - Memory footprint analysis - Duplicate detection and handling - Distribution analysis and visualization - Correlation matrix computation - Cardinality analysis for categorical features ## Target Processes - Exploratory Data Analysis (EDA) Pipeline - Data Collection and Validation Pipeline - Feature Engineering Design and Implementation ## Tools and Libraries - pandas - pandas-profiling / ydata-profiling - numpy - scipy (for statistical tests) ## Input Schema ```json { "type": "object", "required": ["dataPath"], "properties": { "dataPath": { "type": "string", "description": "Path to the data file (CSV, Parquet, JSON)" }, "sampleSize": { "type": "integer", "description": "Number of rows to sample for analysis", "default": 10000 }, "profileType": { "type": "string", "enum": ["minimal", "standard", "full"], "default": "standard" }, "outputFormat": { "type": "string", "enum": ["json", "html", "markdown"], "default": "json" } } } ``` ## Output Schema ```json { "type": "object", "required": ["summary", "columns", "rec...

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Related Skills