experiment-designer

Solid

Design statistically rigorous A/B tests and interpret experiment results. Use when asked to design an experiment, run an A/B test, calculate sample size, interpret test results, or assess whether an experiment was successful. Produces a complete experiment design with hypothesis, sample size, run time, success criteria, and risk flags — or a results interpretation with ship/iterate/kill recommendation.

Web & Frontend 915 stars 165 forks Updated 3 days ago MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%
99
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Experiment Designer Skill Produce rigorous experiment designs from product hypotheses, and interpret results with statistical and practical significance — so you can defend every decision to a sceptical engineering lead or data scientist. ## Required Inputs Ask the user for these if not provided: **For experiment design:** - Hypothesis (what change, what metric, what expected movement) - Current baseline metric value - Minimum detectable effect (MDE) — the smallest lift worth caring about - Available daily sample size **For results interpretation:** - Control and variant results (raw numbers or percentages) - P-value or confidence interval - Run duration (days) - Any anomalies observed during the test ## Two-Phase Process ### Phase 1: Experiment Design 1. Restate hypothesis as: "If we [change], we expect [metric] to [move by X%] because [reason]" 2. Define control and variant clearly 3. Select primary metric (one only) and secondary guardrail metrics (2-3 max) 4. Calculate required sample size from MDE and baseline 5. Estimate run time in days 6. Set pre-defined success criteria before the test runs — no moving goalposts 7. Flag design risks: novelty effects, seasonal confounds, multiple testing issues, network effects, sample ratio mismatch ### Phase 2: Results Interpretation 1. Assess statistical significance (p < 0.05 threshold) 2. Assess practical significance: was the lift meaningful for the business, not just real? 3. Interpret confidence intervals 4. Investiga...

Details

Author
mohitagw15856
Repository
mohitagw15856/pm-claude-skills
Created
4 months ago
Last Updated
3 days ago
Language
Shell
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

Web & Frontend Listed

experiment-designer

Use when planning product experiments, writing testable hypotheses, estimating sample size, prioritizing tests, or interpreting A/B outcomes with practical statistical rigor.

0 Updated 2 days ago
SanctifiedOps
AI & Automation Solid

experiment-designer

Use when planning product experiments, writing testable hypotheses, estimating sample size, prioritizing tests, or interpreting A/B outcomes with practical statistical rigor.

16,642 Updated yesterday
alirezarezvani
Web & Frontend Listed

experiment-design

Design hypothesis-driven experiments and A/B tests with proper methodology. Use when asked to design an A/B test, validate a hypothesis, plan an experiment, or set up a test for a product change. Covers hypothesis writing, sample size, and common mistakes.

2 Updated today
AashutoshR2062
AI & Automation Solid

experiment-design

A discipline for designing experiments (A/B tests, multivariate, holdouts) so the results actually answer the question you asked. Hypothesis writing, sample size, duration, segment analysis, interpretation, decision-making, and the common failure modes that produce confidently wrong shipping decisions.

280 Updated 2 days ago
rampstackco
Testing & QA Solid

ab-test-planner

Design statistically rigorous A/B tests for product features, UI changes, onboarding flows, and pricing experiments. Use when asked to set up an experiment, design an A/B test, calculate sample size, or interpret test results. Produces a complete test plan with hypothesis, variant definitions, sample size, duration estimate, guardrail metrics, and a results interpretation guide.

915 Updated 3 days ago
mohitagw15856