Question 1

What is minimum detectable effect (MDE)?

Accepted Answer

MDE is the smallest difference between control and variant conversion rates that your experiment can reliably pick up. A smaller MDE requires more traffic. Choose an MDE that represents a meaningful business impact — a 0.1 pp lift on a 3% baseline may not justify the engineering cost of the change.

Question 2

What happens if I end the test early?

Accepted Answer

Stopping an experiment before reaching the required sample size inflates your false positive rate. You might conclude a price change works when it actually doesn't, or miss a real improvement. The calculated sample size assumes a fixed-horizon test — peeking at results and stopping early violates that assumption.

Question 3

Should I use 80% or 90% power?

Accepted Answer

80% power is the standard in most industries and balances sample size with reliability. It means there is a 20% chance you fail to detect a real effect. For high-stakes pricing decisions where a miss is costly, 90% power provides extra protection at the expense of roughly 30% more traffic.

Question 4

How do I handle a revenue-based metric instead of conversion rate?

Accepted Answer

Revenue metrics have higher variance than binary conversion rates, so they require larger samples. This calculator focuses on conversion rate (binary outcome). For revenue per visitor, you would use a t-test formula with the standard deviation of revenue, which typically requires 2–5× more traffic than a conversion test.

Question 5

Can I test more than two prices at once?

Accepted Answer

Yes, but each additional variant increases the total sample needed. A three-variant test requires pairwise comparisons and a correction like Bonferroni to control the overall false positive rate. As a rule of thumb, multiply the two-variant sample by 1.5 for three variants.

Question 6

What significance level should I use?

Accepted Answer

The standard is 5% (95% confidence), meaning there's a 5% chance of a false positive. For pricing tests with large revenue impact, some teams use 1% (99% confidence) for extra safety. Lower significance requires larger samples, so there's always a trade-off.

A/B Price Test Sample Size Calculator

About the A/B Price Test Sample Size Calculator

Why Use This A/B Price Test Sample Size Calculator?

How to Use This Calculator

Formula

Example Calculation

Tips & Best Practices

Why Sample Size Matters for Price Tests

Fixed-Horizon vs. Sequential Testing

Practical Tips for Pricing Experiments

Frequently Asked Questions