hypothesis-tester

Featured

Structured hypothesis formulation, experiment design, and results interpretation for Product Managers. Use when the user needs to validate an assumption, design an A/B test, evaluate experiment results, or decide whether to ship based on data. Triggers include "hypothesis", "A/B test", "experiment", "validate assumption", "test this", "should we ship", or when making a decision that should be data-informed.

AI & Automation 2,266 stars 315 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%

100

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# Hypothesis Tester Mode ## Instructions Act as an experiment design partner for a Product Manager. Your role is to help formulate testable hypotheses, design rigorous experiments, and interpret results honestly — including when the data says "don't ship." ### Behavior 1. **Sharpen the hypothesis** — Turn vague beliefs into testable, falsifiable statements 2. **Design the experiment** — Sample size, duration, metrics, guardrails 3. **Anticipate pitfalls** — Selection bias, novelty effects, instrumentation gaps 4. **Interpret honestly** — What the data actually says vs. what the PM wants it to say 5. **Recommend clearly** — Ship, iterate, or kill — with reasoning ### Tone - Rigorous but accessible (no stats jargon without explanation) - Honest about uncertainty - Willing to say "the data doesn't support shipping this" - Focused on decisions, not academic correctness ### What NOT to Do - Don't let the PM confirm bias — challenge "we just need to prove X works" - Don't ignore practical constraints (traffic, time, eng cost) for statistical purity - Don't present p-values without effect sizes - Don't skip guardrail metrics — a feature that lifts one metric while tanking another is a failure ### Advanced Patterns 1. **The hypothesis ladder** — Most PMs start with "will users like this?" which is untestable. Walk them down the ladder: belief → hypothesis → prediction → metric. "Users want voice messages" → "Adding voice messages will increase chat engagement" → "Users with...

Details

Author: jeremylongshore
Repository: jeremylongshore/claude-code-plugins-plus-skills
Created: 7 months ago
Last Updated: today
Language: Python
License: MIT

Integrates with

Anthropic · AI

Similar Skills

Semantically similar based on skill content — not just same category

Testing & QA Listed

pm-value-hypothesis-tester

Pressure-test a value hypothesis (the what / who / how / why-now) before resources are committed, design the smallest experiment that would falsify it, and pre-commit kill criteria. Use when the user is about to launch, fundraise, or scale a product and wants the bet stress-tested — or when they're stuck on a fuzzy "we'll figure it out" hypothesis. Catches insight-free products, segments that are needy but not desperate, hedged "who" decisions, and skipped early-adopter beachheads. Returns a sharpened hypothesis, the falsifying experiment, and the kill threshold.

0 Updated yesterday

kalyvask

AI & Automation Solid

hypothesis-tracker

Hypothesis management skill for tracking business hypotheses through testing and validation

1,034 Updated today

a5c-ai

AI & Automation Solid

experiment-design

A discipline for designing experiments (A/B tests, multivariate, holdouts) so the results actually answer the question you asked. Hypothesis writing, sample size, duration, segment analysis, interpretation, decision-making, and the common failure modes that produce confidently wrong shipping decisions.

280 Updated 2 days ago

rampstackco

Testing & QA Solid

ab-test-setup

When the user wants to plan, design, or implement an A/B test or experiment. Also use when the user mentions "A/B test," "split test," "experiment," "test this change," "variant copy," "multivariate test," or "hypothesis." For tracking implementation, see analytics-tracking.

27,681 Updated today

davila7

AI & Automation Solid

ab-test-setup

When the user wants to plan, design, or implement an A/B test or experiment. Also use when the user mentions "A/B test," "split test," "experiment," "test this change," "variant copy," "multivariate test," "hypothesis," "conversion experiment," "statistical significance," or "test this." For tracking implementation, see analytics-tracking.

16,642 Updated yesterday

alirezarezvani