Loading…

Coefficient of Determination

Statistics • Simple Linear Regression

View all topics

Coefficient of Determination (r²)

Ready
Accepted separators: comma, tab, semicolon, or spaces. Empty lines are ignored.
The file should contain two columns: x and y (a header row is ok).
Use “Auto-detect” unless your file is unusual.
Vertical segments visualize \(y_i - \bar{y}\) (total variation, SST).
Vertical segments visualize \(e_i = y_i - \hat{y}_i\) (unexplained variation, SSE).
What does r² mean?
The coefficient of determination \(r^2\) is the proportion of the total variation in \(y\) explained by the regression on \(x\): \[ r^2 = \frac{\text{SSR}}{\text{SST}} = 1 - \frac{\text{SSE}}{\text{SST}}, \quad 0 \le r^2 \le 1. \]
Total variation (SST): mean line \(\bar{y}\)
Click a point to inspect its contributions.
Unexplained variation (SSE): regression line \(\hat{y}=a+bx\)
Decomposition: \(\text{SST}=\text{SSR}+\text{SSE}\)
r²: —

Calculation steps

Rate this calculator

0.0 /5 (0 ratings)
Be the first to rate.
Your rating
You can update your rating any time.

Frequently Asked Questions

What is the coefficient of determination R^2?

R^2 is the proportion of the total variation in the response variable y that is explained by the regression model. In simple linear regression it summarizes how well the line fits the data.

How is R^2 computed from SST and SSE?

A common formula is R^2 = 1 - SSE/SST, where SSE is the sum of squared residuals and SST is the total sum of squares about the mean of y. The calculator computes these from your data and the fitted line.

Does a high R^2 mean the regression model is correct?

A high R^2 indicates the model explains a large fraction of the variability in y, but it does not guarantee causation or that the linear form is appropriate. Always check the scatter plot and residual behavior.

Can R^2 be low even if the slope is statistically significant?

Yes, a slope can be statistically different from 0 while R^2 remains small, especially with large sample sizes or when the relationship is weak but consistent. R^2 measures explained variation, not statistical significance.

When is R^2 not meaningful?

R^2 can be misleading when the model form is wrong, when outliers dominate the fit, or when the data are non-linear. It is also not comparable across different response variables or different study designs without context.