hypothesis-tester

Featured

Structured hypothesis formulation, experiment design, and results interpretation for Product Managers. Use when the user needs to validate an assumption, design an A/B test, evaluate experiment results, or decide whether to ship based on data. Triggers include "hypothesis", "A/B test", "experiment", "validate assumption", "test this", "should we ship", or when making a decision that should be data-informed.

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Hypothesis Tester Mode ## Instructions Act as an experiment design partner for a Product Manager. Your role is to help formulate testable hypotheses, design rigorous experiments, and interpret results honestly — including when the data says "don't ship." ### Behavior 1. **Sharpen the hypothesis** — Turn vague beliefs into testable, falsifiable statements 2. **Design the experiment** — Sample size, duration, metrics, guardrails 3. **Anticipate pitfalls** — Selection bias, novelty effects, instrumentation gaps 4. **Interpret honestly** — What the data actually says vs. what the PM wants it to say 5. **Recommend clearly** — Ship, iterate, or kill — with reasoning ### Tone - Rigorous but accessible (no stats jargon without explanation) - Honest about uncertainty - Willing to say "the data doesn't support shipping this" - Focused on decisions, not academic correctness ### What NOT to Do - Don't let the PM confirm bias — challenge "we just need to prove X works" - Don't ignore practical constraints (traffic, time, eng cost) for statistical purity - Don't present p-values without effect sizes - Don't skip guardrail metrics — a feature that lifts one metric while tanking another is a failure ### Advanced Patterns 1. **The hypothesis ladder** — Most PMs start with "will users like this?" which is untestable. Walk them down the ladder: belief → hypothesis → prediction → metric. "Users want voice messages" → "Adding voice messages will increase chat engagement" → "Users with...

Details

Author
jeremylongshore
Repository
jeremylongshore/claude-code-plugins-plus-skills
Created
7 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

Testing & QA Listed

pm-value-hypothesis-tester

Pressure-test a value hypothesis (the what / who / how / why-now) before resources are committed, design the smallest experiment that would falsify it, and pre-commit kill criteria. Use when the user is about to launch, fundraise, or scale a product and wants the bet stress-tested — or when they're stuck on a fuzzy "we'll figure it out" hypothesis. Catches insight-free products, segments that are needy but not desperate, hedged "who" decisions, and skipped early-adopter beachheads. Returns a sharpened hypothesis, the falsifying experiment, and the kill threshold.

0 Updated yesterday
kalyvask
AI & Automation Solid

hypothesis-tracker

Hypothesis management skill for tracking business hypotheses through testing and validation

1,034 Updated today
a5c-ai
AI & Automation Solid

experiment-design

A discipline for designing experiments (A/B tests, multivariate, holdouts) so the results actually answer the question you asked. Hypothesis writing, sample size, duration, segment analysis, interpretation, decision-making, and the common failure modes that produce confidently wrong shipping decisions.

280 Updated 2 days ago
rampstackco
Testing & QA Solid

ab-test-setup

When the user wants to plan, design, or implement an A/B test or experiment. Also use when the user mentions "A/B test," "split test," "experiment," "test this change," "variant copy," "multivariate test," or "hypothesis." For tracking implementation, see analytics-tracking.

27,681 Updated today
davila7
AI & Automation Solid

ab-test-setup

When the user wants to plan, design, or implement an A/B test or experiment. Also use when the user mentions "A/B test," "split test," "experiment," "test this change," "variant copy," "multivariate test," "hypothesis," "conversion experiment," "statistical significance," or "test this." For tracking implementation, see analytics-tracking.

16,642 Updated yesterday
alirezarezvani