Linear Correlation

Statistics • Simple Linear Regression

Written by STEM Calculators Team Published December 27, 2025 Updated February 24, 2026

Linear Correlation (Pearson r)

Compute the sample linear correlation coefficient, visualize the relationship, and (optionally) test whether the population correlation is zero using the t distribution.

Paste two numeric columns (x, y) — one pair per line — separated by comma, tab, semicolon, or spaces. Example: 10, 22. You can also upload a CSV file.

Inputs

Significance level α

Alternative hypothesis

Delimiter

Upload CSV (optional)

Data (x, y)

Tip: the correlation coefficient always lies between −1 and 1. Values near 1 indicate strong positive linear association, near −1 strong negative linear association, and near 0 little to no linear association.

What different values of r look like

The panels below are illustrative patterns (perfect/none, and strong/weak). Your own data plot appears after you calculate.

Correlation patterns gallery Ready

Note: correlation describes strength of a linear association. A large |r| does not, by itself, prove causation.

Rate this calculator

0.0 /5 (0 ratings)

Be the first to rate.

Your rating

Name (optional) Review (optional)

You can update your rating any time.

Linear Correlation

This calculator computes the Pearson linear correlation coefficient for paired data \((x,y)\). It measures the strength and direction of a linear relationship between two variables.

What the calculator reports

Correlation coefficient \(r\), which always satisfies \(-1 \le r \le 1\). Values near \(1\) indicate strong positive linear association; values near \(-1\) indicate strong negative linear association; values near \(0\) indicate little linear association.
Coefficient of determination \(r^2\), which is the proportion of variation in \(y\) explained by a straight-line relationship with \(x\).
(Optional) A hypothesis test about the population correlation \(\rho\) using a \(t\)-statistic and a p-value.

Formulas used (summary)

The calculator uses the standard sample correlation formula:

\[ r = \frac{SS_{xy}}{\sqrt{SS_{xx}\,SS_{yy}}} \]

where

\[ SS_{xx}=\sum (x_i-\bar{x})^2,\quad SS_{yy}=\sum (y_i-\bar{y})^2,\quad SS_{xy}=\sum (x_i-\bar{x})(y_i-\bar{y}) \]

For testing \(H_0:\rho=0\) (with \(n\) pairs), the test statistic is:

\[ t = r\sqrt{\frac{n-2}{1-r^2}},\quad \text{df}=n-2 \]

How to use the calculator

Enter your paired data as two columns (x, y) in the textarea. You can separate values using commas, tabs, semicolons, or spaces.
Or upload a CSV file (the calculator will parse it and load the data). A header row like x,y is allowed.
(Optional) Choose \(\alpha\) and the alternative hypothesis if you want a correlation significance test.
Click Calculate to see the graphs first, then the computed results and step-by-step work.
Use Copy clean CSV (and other copy/download buttons) to export your cleaned data and results.

Input example

Paste data like this (one pair per line):

x,y
10,22
15,25
20,29
25,35

Notes and tips

Correlation is about linear patterns. A curved (nonlinear) relationship can produce \(r\) near \(0\) even if the variables are related.
Correlation does not imply causation. A strong \(r\) alone does not prove that changes in \(x\) cause changes in \(y\).
If all \(x\) values are identical (or all \(y\) values are identical), then \(SS_{xx}=0\) (or \(SS_{yy}=0\)) and \(r\) is undefined. The calculator will report an error in that case.

Frequently Asked Questions

What does the Pearson correlation coefficient r measure?

Pearson r measures the strength and direction of a linear relationship between two variables using paired (x, y) data. It ranges from -1 to 1, where values near 1 indicate strong positive linear association and values near -1 indicate strong negative linear association.

How is r calculated from x and y data?

The calculator uses r = SSxy / sqrt(SSxx x SSyy), where SSxx = sum((xi - xbar)^2), SSyy = sum((yi - ybar)^2), and SSxy = sum((xi - xbar)(yi - ybar)). These are corrected sums based on the sample means.

How do you test whether the population correlation rho is zero?

For n paired observations, the test uses t = r x sqrt((n - 2) / (1 - r^2)) with degrees of freedom df = n - 2. The p-value depends on whether the alternative is two-sided, right-tailed, or left-tailed.

What does r^2 mean in a correlation report?

r^2 is the coefficient of determination, interpreted as the proportion of variation in y that is explained by a straight-line relationship with x. It summarizes explained variability for a linear model but does not prove causation.

Why can r be undefined for some datasets?

If all x values are identical then SSxx = 0, and if all y values are identical then SSyy = 0, so the denominator in the r formula becomes zero. In those cases the correlation coefficient cannot be computed.

Linear Correlation

Linear Correlation (Pearson r)

Inputs

What different values of r look like

Results

Computation table

Key sums

Calculation steps

Rate this calculator

Frequently Asked Questions

Inputs

What different values of r look like

Results

Computation table

Key sums

Calculation steps

Rate this calculator

Frequently Asked Questions

Related calculators

Questions related to this topic