Population and Sampling Distributions

Statistics • Sampling Distributions

Written by STEM Calculators Team Published December 20, 2025 Updated February 24, 2026

Population data (paste numbers or CSV)

Accepted separators: commas, semicolons, tabs, spaces, or new lines. If you paste a CSV with a header row, it will be ignored automatically.

Import CSV

Sample size (n)

Sampling distribution will be built for the sample mean x̄.

Sampling method

Without replacement uses the finite population correction for the theoretical standard error.

Enumeration limit (max samples to list)

If the number of possible samples is huge, the calculator will switch to simulation automatically.

Ready

Paste a population, choose a sample size, then click Calculate.

Rate this calculator

0.0 /5 (0 ratings)

Be the first to rate.

Your rating

Name (optional) Review (optional)

You can update your rating any time.

Population and Sampling Distributions

This topic connects two ideas: (1) a probability model for the entire population (population distribution), and (2) the probability model for a statistic computed from repeated samples (sampling distribution).

Population distribution

A population distribution is the probability distribution of the population data. When the population is finite and fully known, probabilities are often built from relative frequencies.

If a value x appears f times in a population of size N, then:

\[ P(x) = \frac{f}{N}, \qquad \sum P(x) = 1 \]

Population mean and population variance are computed directly from the distribution:

\[ \mu = \sum x \cdot P(x), \qquad \sigma^{2} = \sum (x-\mu)^{2} \cdot P(x), \qquad \sigma = \sqrt{\sigma^{2}} \]

Value (x)	Frequency (f)	Relative frequency	Probability P(x)
70	1	1/5	0.20
78	1	1/5	0.20
80	2	2/5	0.40
95	1	1/5	0.20
N = 5		Σ P(x) = 1.00

Sampling distribution of the sample mean x̄

A sampling distribution describes how a statistic varies across repeated samples of the same size drawn from the same population. For this topic, the statistic is the sample mean:

\[ \bar{x} = \frac{x_{1}+x_{2}+\cdots+x_{n}}{n} \]

The sampling distribution of x̄ lists the possible values that x̄ can take and the probability of each value. In a finite population, you can build it by:

Listing all possible samples of size n (exact when feasible),
Computing x̄ for each sample,
Converting frequencies of x̄ values into probabilities.

Number of possible samples depends on the sampling method:

\[ \text{Without replacement: } \binom{N}{n}, \qquad \text{With replacement: } N^{n} \]

Key results for x̄ (mean and standard error)

The sampling distribution has its own mean and spread. Two results are especially important:

Mean of x̄ matches the population mean: \[ E(\bar{x}) = \mu \]
Standard error of x̄ measures typical sampling-to-sampling variation. \[ SE(\bar{x}) = \frac{\sigma}{\sqrt{n}} \quad \text{(with replacement)} \] \[ SE(\bar{x}) = \frac{\sigma}{\sqrt{n}} \cdot \sqrt{\frac{N-n}{N-1}} \quad \text{(without replacement)} \]

In practice, if the exact list of all samples is too large, a simulation (many random samples) provides a good approximation to the sampling distribution.

Probability properties used

Bounds: 0 ≤ P(·) ≤ 1
Total probability: Σ P(·) = 1
Expected value: \(E(X)=\sum x \cdot P(x)\)
Variance and standard deviation: \(Var(X)=\sum (x-E(X))^{2}\cdot P(x)\), \(SD(X)=\sqrt{Var(X)}\)

Tip: Use the calculator to paste your population (or import CSV), then compare the exact sampling distribution (when feasible) to a simulated one. The two should align closely when the number of simulation trials is large.

Frequently Asked Questions

What is the difference between a population distribution and a sampling distribution?

A population distribution describes probabilities for values in the full population (often from relative frequencies in a finite list). A sampling distribution describes how a statistic such as the sample mean (xbar) varies across repeated samples of the same size drawn from that population.

How is the sampling distribution of the sample mean (xbar) built in this calculator?

When possible, the calculator enumerates all possible samples of size n, computes xbar for each sample, and converts frequencies of xbar values into probabilities. If the number of possible samples is too large, it switches to simulation with many random samples.

How do you compute the standard error of the sample mean?

With replacement, SE(xbar) = sigma / sqrt(n). Without replacement from a finite population, SE(xbar) = (sigma / sqrt(n)) x sqrt((N - n) / (N - 1)), which applies the finite population correction.

Why does the calculator switch to simulation instead of listing every sample?

The number of possible samples grows very quickly (for example, N choose n without replacement or N^n with replacement). When exact enumeration would be too large to compute or list, simulation provides a practical approximation.

When should I use sampling without replacement versus with replacement?

Use without replacement when you sample from a finite set and you cannot select the same population element more than once in a sample. Use with replacement when repeated selection of the same element is allowed or when modeling independent draws from the same finite population values.

Population and Sampling Distributions

Calculation steps

Visualization

Rate this calculator

Frequently Asked Questions

Calculation steps

Visualization

Rate this calculator

Frequently Asked Questions

Related calculators

Questions related to this topic