r
Statistical Tools
Correlation Coefficient Calculator
Browse all pages
Power analysis before data collection

Sample Size Calculator for Correlation

Running a study with n=20 and hoping to detect r=0.3? You have less than a one-in-three chance of finding it. Know the right number before you start.

Quick answer

To detect a moderate correlation around r = 0.30 with 80% power at alpha = 0.05, plan for about 85 participants in a two-tailed study.

Tails
Required sample size
n = 85

To detect r = 0.300 with 80% power at alpha = 0.05 (two-tailed), you need at least 85 participants.

Expected r
0.300
Alpha
0.05
Tails
Two

At n = 85, the achieved power is 80.0%. Recruit more than this if you expect exclusions or missing data.

Power curve

The curve uses your expected r = 0.300.

Power at n=85: 80.0%
0.20.40.60.81.020100200300400500target 80%n=85PowerSample size (n)
Feasibility check

You should plan around n = 85

Add a recruitment buffer for missing data, exclusions, and dropouts. A practical target is about n = 98.

Lab study

Challenging but achievable. Most labs should plan multiple recruitment waves.

Survey study

Very feasible. A well-screened online survey can usually reach this range.

Clinical study

Plan carefully. Patient recruitment can take months for this sample size.

Parameter sensitivity

Small changes in expected r or power can change recruitment by hundreds of people. Use this table to test whether your design is realistic.

Expected ralpha = 0.05, power = 0.80alpha = 0.05, power = 0.90alpha = 0.01, power = 0.80
r = 0.10very weakn = 783n = 1,047n = 1,164
r = 0.20weakn = 194n = 259n = 288
r = 0.30moderaten = 85n = 113n = 125
r = 0.50strongn = 30n = 38n = 42
r = 0.70very strongn = 14n = 17n = 19

The four numbers that determine your study's fate

Correlation power analysis is not a black box. You are deciding four numbers: how large an effect you expect, how much false-positive risk you accept, how much power you want, and how many people you are able to recruit.

Expected correlation r

Effect size

This is the most important number. Base it on prior literature in your field, not on a guess that merely feels comfortable.

Cohen's old benchmarks: r = 0.10 small, 0.30 medium, 0.50 large.

Interpret r value
False positive rate

Alpha

Alpha is the chance you accept for a false alarm when there is really no relationship. Most studies use 0.05.

Use 0.01 when the decision is high-stakes or you have many comparisons.

See the p-value guide
True positive rate

Power

Power is the chance of detecting a real relationship if it actually exists. The usual planning target is 0.80.

If the study matters a lot, 0.90 is a safer target.

Why power matters
The number you solve for

Sample size

More participants do not just make results stronger. They make modest correlations detectable at all.

Small studies can still work, but only for stronger effects.

Understand Pearson r

How the calculation works

Correlation coefficients cannot be plugged directly into power formulas because their sampling distribution is skewed. Fisher's z-transformation makes the planning math approximately normal.

z=12ln(1+r1r)=arctanh(r)z = \frac{1}{2}\ln\left(\frac{1+r}{1-r}\right) = \operatorname{arctanh}(r)
n=(z1α/2+z1βzrzr0)2+3n = \left(\frac{z_{1-\alpha/2} + z_{1-\beta}}{z_r - z_{r_0}}\right)^2 + 3
Step 1

Choose the expected r

Use prior studies or a conservative literature-based estimate.

Step 2

Set alpha and power

Most planning uses alpha = 0.05 and power = 0.80.

Step 3

Transform with Fisher's z

The z scale behaves much better than raw correlations for planning.

Step 4

Round up

If the formula gives 84.93, recruit 85 and then add a buffer.

Example: with r = 0.30, alpha = 0.05, and 80% power, the calculation lands at about n = 85.

Quick reference: sample sizes for common scenarios

These are planning estimates for a two-tailed test with alpha = 0.05 and 80% power. Stronger expected correlations need fewer participants. Weaker effects need many more.

ScenarioExpected rApprox. nWhy it matters
Psychology scale validation0.3562Typical self-report and validation work
Biomedical biomarker association0.5030Stronger clinical relationships
Education research0.25124Common academic and classroom correlations
Economics variables0.15347Usually weak-to-small effects
Genetic association0.081,225Very small but sometimes real effects
Exploratory study0.3085A conservative planning value

Rule of thumb: never plan a correlation study on fewer than 30 people unless the expected effect is very large.

Five sample size mistakes that sink a study

Mistake

"Post-hoc power analysis"

Power analysis is a planning tool. If you already have the data, checking power after the fact does not rescue a weak result.

Mistake

"Blindly using Cohen's benchmark"

r = 0.30 is a placeholder, not a law. Real studies in your field may need a very different expected effect size.

Mistake

"Ignoring multiple comparisons"

If you run many correlations, Bonferroni or a similar correction lowers alpha and raises the sample size you need.

Mistake

"Confusing significance with usefulness"

With a large enough n, tiny correlations can become significant while still explaining almost no variance.

Mistake

"Forgetting attrition and missing data"

Add a buffer of about 10% to 20% so dropouts, exclusions, and bad rows do not undercut your design.

FAQ

What is the minimum sample size for a correlation study?

There is no universal minimum, but n = 30 is often treated as a rough floor. For r around 0.30 with 80% power at alpha = 0.05, you need about 85 participants.

Is r = 0.30 a small or medium effect?

By Cohen's conventions, r = 0.10 is small, r = 0.30 is medium, and r = 0.50 is large. Treat those as starting points, not final truth.

Can I use this calculator for Spearman correlation?

Yes, as a planning approximation. The same Fisher z framework is commonly used when planning rank-based correlation studies.

What if my expected correlation comes from a pilot study?

Pilot correlations are often inflated. A conservative shrinkage estimate is safer than planning the main study around the raw pilot result.

One-tailed or two-tailed?

Two-tailed is the default. Use one-tailed only when your direction was fixed in advance and reversing direction would not count as success.