Loading…

Law of Large Numbers (Meaning, Conditions, and Sample Mean Convergence)

What is the law of large numbers, and what does it guarantee about the behavior of the sample mean (or sample proportion) as the sample size increases?

Subject: Statistics Chapter: Sampling Distributions Topic: Population and Sampling Distributions Answer included
law of large numbers weak law of large numbers strong law of large numbers sample mean sample proportion convergence in probability almost sure convergence expectation
Accepted answer Answer included

Law of large numbers

The law of large numbers is a foundational result connecting population parameters to sample statistics. It formalizes the idea that averages computed from repeated sampling stabilize around the underlying expected value. In sampling-distribution language, it describes the concentration of the sampling distribution of the sample mean \(\bar{X}\) (and, in Bernoulli settings, the sample proportion \(\hat{p}\)) around the corresponding population parameter as the sample size grows.

With independent, identically distributed observations having a finite mean \(\mu\), the sample mean \(\bar{X}_n\) approaches \(\mu\) as \(n\) increases. Random fluctuation remains present, but large sustained departures from \(\mu\) become increasingly unlikely.

Mathematical statement

Let \(X_1, X_2, \dots\) be independent and identically distributed (i.i.d.) random variables with finite expectation \(E[X_i]=\mu\). Define the sample mean \[ \bar{X}_n=\frac{1}{n}\sum_{i=1}^{n} X_i \] The law of large numbers asserts that \(\bar{X}_n\) converges to \(\mu\) as \(n\to\infty\) in a precise probabilistic sense.

Weak and strong forms

Form Convergence statement Meaning in probability language
Weak law of large numbers (WLLN) \(\bar{X}_n \to \mu\) in probability For every \(\varepsilon>0\), \(P\!\left(\lvert \bar{X}_n-\mu\rvert>\varepsilon\right)\to 0\). Large deviations become rare as \(n\) increases.
Strong law of large numbers (SLLN) \(\bar{X}_n \to \mu\) almost surely With probability 1, a single infinite sequence of averages eventually stays arbitrarily close to \(\mu\). Randomness remains, but persistent divergence has probability 0.

Both forms express stabilization of averages. The strong form is a stronger mode of convergence than the weak form. Standard sufficient conditions include i.i.d. sampling with finite \(E[\lvert X\rvert]\) for the strong law and finite variance (or other moment conditions) for many common weak-law proofs.

Bernoulli trials and sample proportion

A central special case uses Bernoulli random variables. Let \(X_i\in\{0,1\}\) with \(P(X_i=1)=p\). The sample proportion of successes is \[ \hat{p}_n=\bar{X}_n=\frac{1}{n}\sum_{i=1}^{n}X_i \] The law of large numbers implies \(\hat{p}_n\to p\), meaning relative frequencies stabilize around the underlying probability \(p\) as the number of trials grows.

Visualization of convergence of a running mean

Law of large numbers: running mean stabilizes near the expected value Example run: Bernoulli trials with p = 0.5, running mean from n = 1 to 200 1.00 0.75 0.50 0.25 0.00 running mean 1 25 50 100 150 200 n μ = 0.50 ± 0.05 band n = 200 running mean expected value μ ±0.05 band
The blue curve shows one realized running mean \(\bar{X}_n\) for Bernoulli trials with \(p=0.5\). Early averages fluctuate widely because each new observation has a large influence. As \(n\) grows, changes in \(\bar{X}_n\) become smaller, and the curve tends to stay near the horizontal reference line at \(\mu=0.50\).

Finite-sample accuracy statements

The law of large numbers is an asymptotic guarantee; finite-sample behavior is described by probability bounds. Under i.i.d. sampling with finite variance \(\sigma^2=\operatorname{Var}(X)\), Chebyshev’s inequality implies \[ P\!\left(\lvert \bar{X}_n-\mu\rvert \ge \varepsilon\right)\le \frac{\sigma^2}{n\varepsilon^2} \] This bound exhibits the characteristic \(1/n\) shrinkage in tail probability for fixed \(\varepsilon\).

For bounded variables such as Bernoulli trials, sharper exponential bounds are available. For \(X_i\in[0,1]\) with \(E[X_i]=p\), Hoeffding’s inequality gives \[ P\!\left(\lvert \hat{p}_n-p\rvert \ge \varepsilon\right)\le 2e^{-2n\varepsilon^2} \] which quantifies how quickly relative frequency concentrates around \(p\).

Common misconceptions and pitfalls

Law of large numbers versus central limit theorem

The law of large numbers concerns convergence of the sample mean to \(\mu\). The central limit theorem concerns the approximate normal shape of the standardized sampling distribution of \(\bar{X}_n\) for large \(n\). Stabilization of \(\bar{X}_n\) and normal approximation address different questions.

Randomness does not disappear

Large samples reduce relative fluctuation of averages, but individual outcomes remain random. Long runs or short-term imbalance remain possible even when the long-run proportion converges.

Condition failures

Independence and identical distribution are structural assumptions in common formulations. Heavy-tailed models with infinite mean, strong dependence, or changing distributions can violate the usual conclusions for \(\bar{X}_n\).

Vote on the accepted answer
Upvotes: 0 Downvotes: 0 Score: 0
Community answers No approved answers yet

No approved community answers are published yet. You can submit one below.

Submit your answer Moderated before publishing

Plain text only. Your name is required. Links, HTML, and scripts are blocked.

Fresh

Most recent questions

109 questions · Sorted by newest first

Showing 1–10 of 109
per page
  1. Mar 5, 2026 Published
    Formula of the Variance (Population and Sample)
    Statistics Numerical Descriptive Measures Measures of Dispersion for Ungrouped Data
  2. Mar 5, 2026 Published
    Mean Median Mode Calculator (Formulas, Interpretation, and Example)
    Statistics Numerical Descriptive Measures Measures of Central Tendency for Ungrouped Data
  3. Mar 4, 2026 Published
    How to Calculate Standard Deviation in Excel (STDEV.S vs STDEV.P)
    Statistics Numerical Descriptive Measures Measures of Dispersion for Ungrouped Data
  4. Mar 4, 2026 Published
    Suppose T and Z Are Random Variables: How T Relates to Z in the t Distribution
    Statistics Estimation of the Mean and Proportion Estimation of a Population Mean σ Not Known the T Distribution
  5. Mar 4, 2026 Published
    What Does R Squared Mean in Statistics (Coefficient of Determination)
    Statistics Simple Linear Regression Coefficient of Determination
  6. Mar 3, 2026 Published
    Box and Plot Graph (Box Plot) Explained
    Statistics Numerical Descriptive Measures Box and Whisker Plot
  7. Mar 3, 2026 Published
    How to Calculate a Z Score
    Statistics Continuous Random Variables and the Normal Distribution Standardizing a Normal Distribution
  8. Mar 3, 2026 Published
    How to Calculate Relative Frequency
    Statistics Organizing and Graphing Data Organizing and Graphing Quantitative Data
  9. Mar 3, 2026 Published
    Is zero an even number?
    Statistics Numerical Descriptive Measures Measures of Central Tendency for Ungrouped Data
  10. Mar 3, 2026 Published
    Monty Hall Paradox (Conditional Probability Explained)
    Statistics Probability Marginal and Conditional Probabilities
Showing 1–10 of 109
Open the calculator for this topic