r
Statistical Tools
Correlation Coefficient Calculator
Browse all pages
Explained variance

R-Squared (r²): The Coefficient of Determination Explained

You found r = 0.70 and called it a strong correlation. But r² = 0.49 means 51% of the variation is still unexplained. Here is what that means for your research.

Quick definition

R-squared measures the proportion of variance in one variable that is explained by another variable. It ranges from 0 to 1, or 0% to 100%.

r² calculator & interpreter

Convert r into explained variance

Large effect
Your variance pie
49%explained
51% unexplained
r
0.70
r squared
0.49
49%
49% explained51% unexplained

X explains 49% of the variance in Y. The remaining 51% is due to other factors, measurement noise, or variables not captured in this simple relationship. The original correlation is positive, but r squared reports magnitude rather than direction.

Effect size: Large (Cohen's benchmark: r² >= .25)

What is r-squared?

R-squared (r²), also called the coefficient of determination, measures the proportion of variance in one variable that is explained by another variable. It ranges from 0 to 1, or from 0% to 100%.

r² valueMeaning
r squared = 0X explains none of the variation in Y. The model is no better than the mean.
0 < r squared < 1X explains part of the variation in Y. Higher values mean more explained variance.
r squared = 1X perfectly explains all variation in Y. This is rare outside controlled systems.
Variance pie

Think of variance as a pie

Even a strong correlation can leave a large part of the outcome unexplained. That is why research often needs multiple variables, not one single predictor.

Weak correlation
9%explained
91% unexplained
r = 0.30
r² = 0.09
Your current r
49%explained
51% unexplained
r = 0.70
r² = 0.49
Very strong correlation
81%explained
19% unexplained
r = 0.90
r² = 0.81
Explained Unexplained

Why r² is more honest than r

r is not a percentage. r² is. When you say r = 0.70, you are not saying 70% explained; you are saying 49% explained. The jump from r to r² is where most people's intuition breaks down.

r valueCommon intuitionr² valueActual explanatory power
0.30Weak correlation0.09Only 9% of variance explained
0.50Moderate correlation0.25Only 25% of variance explained
0.70Strong correlation0.49Only 49% of variance explained
0.90Very strong correlation0.8181% of variance explained
0.95Nearly perfect0.9090% of variance explained

What is a good r²?

It depends on your field. The current value is closest to the benchmark band for Education research, but field norms matter more than universal cutoffs.

Current value
r² = 0.49

X explains a substantial share of Y, but there is still meaningful residual variation.

Research fieldTypical r² rangeWhy
Physics and engineering0.90 to 0.99Systems are controlled and measurement noise is often low.
Medicine and physiological measures0.50 to 0.80Biological systems are patterned but still contain person-level variation.
Psychology and behavior0.10 to 0.40Human behavior usually has many interacting causes.
Social science and economics0.05 to 0.30Important causes are numerous and hard to control.
Financial markets0.01 to 0.15Markets are noisy, adaptive, and heavily affected by shocks.
Education research0.15 to 0.50Outcomes are moderately predictable but still shaped by many factors.
r² valueCohen effect sizeCorresponding r
0.01 or higherSmall0.10 or higher
0.09 or higherMedium0.30 or higher
0.25 or higherLarge0.50 or higher
APA templates

How to report r²

Use r² for a simple correlation or simple regression. In APA style, report decimals without a leading zero and usually round to two decimal places.

Current APA-style value: r² = .49

Simple linear regression

A simple linear regression was performed to examine the relationship between [X] and [Y]. The results indicated that [X] significantly predicted [Y], β = [β value], t([df]) = [t value], p = [p value]. The model explained a significant proportion of variance in [Y], r² = [value], F([df1], [df2]) = [F value], p = [p value].

Correlation report

There was a [positive/negative] correlation between [X] and [Y], r([df]) = [r value], p = [p value], r² = [value], indicating that [X] accounted for [r² × 100]% of the variance in [Y].

How to calculate r²

There are two common paths. If you already have a correlation coefficient, square it. If you are working from a regression model, compare the unexplained residual variation with the total variation in Y.

Path 1: from r

r2=r×rr^2 = r \times r

Example: r = 0.70 gives r² = 0.70 × 0.70 = 0.49.

Path 2: from regression residuals

R2=1SSresSStot=1(yiy^i)2(yiyˉ)2R^2 = 1 - \frac{SS_{res}}{SS_{tot}} = 1 - \frac{\sum (y_i - \hat{y}_i)^2}{\sum (y_i - \bar{y})^2}

SSresSS_{res} is residual variation the model missed. SStotSS_{tot} is the total variation in Y.

For simple linear regression with one predictor, both formulas give the same result. Formula 2 generalizes to multiple regression, where squaring a single correlation no longer describes the full model.

Three things r² does not tell you

Misread

High r² does not automatically mean a good model

A biased model can have high r². If the regression line systematically over-predicts at low values and under-predicts at high values, the model can still be wrong. Always check residual plots.

Misread

Low r² does not automatically mean useless

An r² of .06 in human behavior can be meaningful when the effect is reliable and scalable. Statistical significance and practical importance are different judgments.

Misread

r² is not causation

r² = .81 between two variables does not prove one causes the other. It measures explanatory power inside your model, not a causal mechanism in the world.

When to use adjusted R² instead

Every variable you add to a regression model increases regular R² or leaves it the same, even if the variable is noise. Adjusted R² penalizes unhelpful complexity.

Rˉ2=1(1R2)n1nk1\bar{R}^2 = 1 - (1 - R^2)\frac{n - 1}{n - k - 1}

Here kk is the number of predictors and nn is the sample size.

SituationUse which value?
Simple linear regression with one predictorr² and adjusted R² are usually close; use r² for plain explanation
Multiple regression with two or more predictorsUse adjusted R² so extra variables are penalized
Comparing models with different predictor countsUse adjusted R² because regular R² always rises or stays flat

FAQ

What does r-squared mean in simple terms?

R-squared tells you what percentage of the variation in Y can be explained by X. For example, r² = 0.64 means 64% of the differences in Y are accounted for by X, while 36% remains unexplained.

Is r-squared the same as correlation?

No. r is the correlation coefficient and ranges from -1 to +1. r² is its square and ranges from 0 to 1. r tells you direction and strength; r² tells you the proportion of variance explained.

What is a good r-squared value?

It depends on the field. In physics, r² below 0.90 may be weak. In psychology, r² around 0.15 can be meaningful because behavior is influenced by many variables.

Can r-squared be negative?

When calculated as r squared from a correlation coefficient, no. In regression, R² = 1 - SSres/SStot can be negative if a model fits worse than a mean-only model.

What is the difference between r² and adjusted R²?

Regular r² always increases or stays the same when you add predictors. Adjusted R² penalizes model complexity, so it only increases when a new predictor improves the model enough.