Runs Test for Randomness

Statistics • Nonparametric Methods

Written by STEM Calculators Team Published December 27, 2025 Updated February 24, 2026

View all topics

Mode

Runs are counted after converting to a + / − sequence.

Significance level (α)

Alternative

Tail shading changes in the curve plot.

p-value method

Exact is most useful for small samples.

Continuity correction Use ±0.5 correction

“+” and “−” meaning

Used only in binary mode.

Custom “+” token

Examples: A, Up, Yes

Custom “−” token

Examples: B, Down, No

Binary sequence input

Sequence

Tips

Separators can be spaces, commas, or new lines.
A single compact string like HTHHTT is also accepted.
Unrecognized tokens are ignored.

Delimiter (for CSV paste/file)

CSV with headers is OK; numeric mode uses the first numeric column.

Load CSV file

Loads into the currently selected mode’s textarea.

Ready

Results

—

Choose a mode, enter your data, then click Calculate.

Rate this calculator

0.0 /5 (0 ratings)

Be the first to rate.

Your rating

Name (optional) Review (optional)

You can update your rating any time.

Runs Test for Randomness

The runs test is a nonparametric procedure used to check whether the order of observations in a sequence is “random-like.” It does not test normality or independence directly; instead, it evaluates whether the sequence shows too much clustering (too few runs) or too much alternation (too many runs) compared with what we would expect under randomness.

What is a run?

A run is a maximal block of identical symbols occurring consecutively. For example, the sequence \[ +\;+\;-\;-\;-\;+\;-\;-\;+\;+\;+\; \] has runs: \((++)\), \((---)\), \((+)\), \((--)\), \((+++)\), so the number of runs is \(R=5\).

Inputs and sequence construction

The test requires a binary sequence of “+” and “−”. This calculator supports two common ways to build it:

Binary mode: You enter the symbols directly (e.g., H/T, 0/1, +/−, or custom tokens). The calculator maps your chosen symbols into “+” and “−”.
Numeric mode: You enter a numeric series \(x_1,x_2,\dots,x_n\) and convert it into a binary sequence using a cutoff \(c\):
- Median rule: \(c = \mathrm{median}(x)\)
- Threshold rule: \(c\) is a user-chosen value
Then define \[ \begin{aligned} S_i &= \begin{cases} +, & x_i > c, \\ -, & x_i < c. \end{cases} \end{aligned} \] If \(x_i=c\), you may drop that observation (recommended) or force it into “+” or “−”.

Notation

After conversion, let:

\[ \begin{aligned} n_+ &= \#\{i : S_i=+\}, \\ n_- &= \#\{i : S_i=-\}, \\ N &= n_+ + n_-. \end{aligned} \]

The test requires both categories to appear (\(n_+>0\) and \(n_->0\)).

Hypotheses

The null hypothesis is that the sequence is random-like (with fixed counts \(n_+\) and \(n_-\)). The alternative can be selected depending on the pattern you suspect:

\[ \begin{aligned} H_0 &: \text{The sequence order is random-like (in terms of runs)} \\ H_1 &: \text{The sequence is not random-like (two-sided)} \\ H_1 &: \text{Too few runs (clustering)} \\ H_1 &: \text{Too many runs (oscillation)} \end{aligned} \]

Counting the number of runs

The number of runs \(R\) can be computed by counting transitions:

\[ \begin{aligned} R &= 1 + \sum_{i=2}^{N} \mathbf{1}(S_i \ne S_{i-1}). \end{aligned} \]

Normal approximation (z method)

When the sample size is moderate/large, a normal approximation is commonly used. Under \(H_0\), the expected number of runs and the variance are:

\[ \begin{aligned} \mu_R &= \frac{2n_+ n_-}{n_+ + n_-} + 1, \\ \sigma_R^2 &= \frac{2n_+ n_-(2n_+ n_- - n_+ - n_-)}{(n_+ + n_-)^2 (n_+ + n_- - 1)}. \end{aligned} \]

The standardized statistic is

\[ \begin{aligned} z &= \frac{R - \mu_R}{\sigma_R}. \end{aligned} \]

A continuity correction is sometimes applied because \(R\) is discrete. The calculator provides the common \(\pm 0.5\) adjustment for the selected tail.

The p-value is computed using the standard normal CDF \(\Phi(\cdot)\):

\[ \begin{aligned} p\text{-value} &= \begin{cases} 2\left(1-\Phi(|z|)\right), & \text{two-sided}, \\ \Phi(z), & \text{few runs (left-tail)}, \\ 1-\Phi(z), & \text{many runs (right-tail)}. \end{cases} \end{aligned} \]

Exact method (small samples)

For smaller \(N\), an exact p-value can be obtained from the exact distribution of \(R\) given \(n_+\) and \(n_-\). Under \(H_0\), all sequences with \(n_+\) pluses and \(n_-\) minuses are equally likely, and the total number of such sequences is

\[ \begin{aligned} \#\{\text{sequences}\} &= \binom{N}{n_+}. \end{aligned} \]

The exact p-value is the probability (under \(H_0\)) of observing a run count as extreme as the observed \(R\), according to the chosen alternative:

\[ \begin{aligned} p\text{-value} &= \frac{\sum_{r \in \mathcal{T}} \#\{\text{sequences with } r \text{ runs}\}} {\binom{N}{n_+}}, \end{aligned} \] \[ \begin{aligned} \mathcal{T} &= \begin{cases} \{r: r \le R\}, & \text{few runs}, \\ \{r: r \ge R\}, & \text{many runs}, \\ \{r: |r-\mu_R| \ge |R-\mu_R|\}, & \text{two-sided}. \end{cases} \end{aligned} \]

Decision and interpretation

With significance level \(\alpha\):

\[ \begin{aligned} \text{Reject } H_0 &\text{ if } p\text{-value} \le \alpha, \\ \text{otherwise } &\text{fail to reject } H_0. \end{aligned} \]

Practical interpretation:

Too few runs \(\Rightarrow\) values tend to cluster (long streaks), suggesting non-random grouping.
Too many runs \(\Rightarrow\) values alternate frequently, suggesting oscillation or over-regularity.
Near expected runs \(\Rightarrow\) no evidence against randomness based on runs.

Reminder: “Fail to reject” does not prove randomness; it means the run count is not unusually extreme at the chosen \(\alpha\).

Frequently Asked Questions

What is a run in the runs test for randomness?

A run is a maximal consecutive block of identical symbols in a +/− sequence. The test counts how many runs appear and compares that count to what randomness would typically produce given the numbers of pluses and minuses.

How does this calculator convert a numeric series into + and −?

It classifies each value relative to a cutoff c using either the sample median or a user-entered threshold. Values above c become +, values below c become −, and values equal to c can be dropped or forced into + or − based on your setting.

What do “too few runs” and “too many runs” mean?

Too few runs suggests clustering or long streaks of the same symbol, which can indicate non-random grouping. Too many runs suggests frequent alternation, which can indicate oscillation or over-regularity.

When should I use the exact p-value method instead of the normal approximation?

The exact method is most useful for small samples because it uses the exact distribution of the run count given n-plus and n-minus. The normal approximation is a common choice for moderate or large samples and can optionally use a continuity correction.

Why does the calculator require both + and − to appear?

The runs test compares patterns of alternation between two categories, so it needs at least one + and one − to define runs and compute the expected run behavior under randomness.

Runs Test for Randomness

Binary sequence input

Numeric series input

Numeric series (cutoff + colored points)

Sequence strip (animate into runs)

Sampling distribution (normal approx for R)

Results

Calculation steps

Rate this calculator

Frequently Asked Questions

Binary sequence input

Numeric series input

Numeric series (cutoff + colored points)

Sequence strip (animate into runs)

Sampling distribution (normal approx for R)

Results

Calculation steps

Rate this calculator

Frequently Asked Questions

Related calculators