How should I interpret a high standard deviation?

A high standard deviation means the observations are spread farther from the mean on average. Whether that spread is acceptable depends on the context: wide dispersion might signal risk in finance, instability in manufacturing, or genuine natural variation in scientific data.

Why do some articles mention n while others mention n-1?

The denominator reflects the difference between population and sample formulas. Population variance and population standard deviation use N because the full dataset is known. Sample variance and sample standard deviation often use n-1 because Bessel’s correction reduces bias when estimating population spread from a sample.

What is a statistical interpretation guide?

A statistical interpretation guide is a page that moves beyond arithmetic and explains meaning. It tells you what a metric is, when the formula applies, and how to describe the result in plain English without overstating certainty.

Can I cite this article in a report?

You should cite the underlying authoritative reference for formal work whenever possible. This page is best used as an explanatory bridge that helps you understand the concept before quoting the original standard or handbook.

Why include direct citations on every article page?

Direct citations give readers a route to verify the definition, notation, and assumptions. That improves trust and reduces the chance that a simplified explanation is mistaken for the entire technical standard.

Three Sigma Rule in Statistics: Formula, Example, and Limits

Quick Answer

The three sigma rule says that, for an approximately normal distribution, almost all routine values should fall within three standard deviations of the mean. In formula form, the usual band is mean - 3s to mean + 3s for sample data, or mu - 3sigma to mu + 3sigma for a known population.

Sample three sigma band

lower = xbar - 3s; upper = xbar + 3s

Under a normal model, about 99.73% of observations fall inside that band and about 0.27% fall outside it. That does not make every outside value an error. It makes the value unusual enough to investigate.

Use the Standard Deviation Calculator, Sample Standard Deviation Calculator, or Descriptive Statistics Calculator to compute the mean and standard deviation first. Use the Z-Score Calculator when you already know the mean, standard deviation, and value being checked.

Scenario: Applying the Rule

A student in a statistics course or an analyst on a quality team often has the same problem: the standard deviation has been calculated, but the next question is whether a new observation is routine variation or a signal.

As a statistics educator, I frame the three sigma rule as a screening rule, not a verdict. The objective is to create a defensible review band for a specific process: measurements inside the band are treated as expected variation, while measurements outside the band are checked for data entry mistakes, special causes, instrument drift, or a real process shift.

Key result

If the process is stable and roughly normal, a value with |z| greater than 3 is a practical investigation signal. If the data are skewed, heavy-tailed, seasonal, or already contaminated by extreme values, use the rule only as a rough screen and compare it with Empirical Rule vs Chebyshev's Theorem or Modified Z-Score Outlier Detection.

Formula

Standardized distance

z = (x - xbar) / s

Here, x is the observation being checked, xbar is the sample mean, and s is the sample standard deviation. A z-score of 3.0 means the observation is three sample standard deviations above the mean. A z-score of -3.0 means it is three sample standard deviations below the mean.

Calculate the baseline mean

Use historical data from a stable process. Do not include the suspicious new value if you are asking whether that value is unusual relative to the baseline.

Calculate the baseline standard deviation

Use sample standard deviation unless the baseline is the full population you intend to describe. See Sample vs. Population: Which Formula to Use? if the denominator choice is unclear.

Build the three sigma band

Compute xbar - 3s and xbar + 3s. Values outside the band get reviewed.

Check the distribution shape

The normal assumption matters. If the histogram is skewed or has long tails, the normal 99.73% statement is not reliable.

Worked Example: Fill Weights

A packaging analyst checks whether new bottle fill weights are consistent with a stable line. The recent baseline sample, in grams, is: `499.4, 499.6, 499.8, 499.9, 500.0, 500.1, 500.2, 500.3, 500.4, 500.5, 500.7, 500.9`.

For those 12 baseline fills, the sample mean is 500.150 g and the sample standard deviation is 0.442 g. The three sigma limits are `500.150 - 3(0.442) = 498.824 g` and `500.150 + 3(0.442) = 501.476 g`.

New fill weight	z-score	Inside 3 sigma band?	Decision
499.1 g	-2.38	Yes	Routine variation under this rule
501.7 g	3.51	No	Investigate the high fill
498.8 g	-3.05	No	Investigate the low fill

The 501.7 g bottle is above the upper limit by 0.224 g. That is a statistical signal, but the next action is operational: check whether the filler nozzle was adjusted, whether the scale was calibrated, whether the product temperature changed, and whether nearby bottles show the same pattern.

First-hand calculation note

When I teach this example, I keep the suspected values out of the baseline calculation. If 501.7 g is included before computing the standard deviation, the standard deviation inflates and the very value being tested helps hide itself. That is a common spreadsheet mistake in outlier screens.

Probability Table

The three sigma rule is a normal-distribution rule. The NIST normal-distribution reference gives the standard normal model behind these percentages. The exact values below are useful when deciding whether you need a quick screen or a formal probability calculation.

Band around the mean	Approximate share inside	Approximate share outside	Use
mean +/- 1s	68.27%	31.73%	Describe ordinary spread
mean +/- 2s	95.45%	4.55%	Create an early warning screen
mean +/- 3s	99.73%	0.27%	Flag rare observations for review

For a one-sided question, split the outside probability across the two tails. Under the normal model, values above mean + 3s account for about 0.135% of observations, and values below mean - 3s account for about 0.135%.

Decision Criteria

Use it:The baseline process is stable, the distribution is roughly bell-shaped, and the cost of reviewing a small number of signals is acceptable.
Use control charts:The data arrive over time and you need to distinguish routine variation from special-cause variation. Start with [Control Charts and Process Control](/learn/control-charts).
Use a z-score:You need to report how far a single value is from the mean in standard deviation units. See [Z-Score Explained](/learn/z-score-explained).
Use a robust method:The data are skewed, heavy-tailed, or already include extreme values. Compare [Robust Statistics](/learn/robust-statistics) and [Modified Z-Score Outlier Detection](/learn/modified-z-score-outlier-detection).
Use a tolerance limit:A customer, regulator, or engineering specification already defines the acceptable range. A three sigma signal can support investigation, but it does not replace the requirement.

Do not confuse a three sigma band with a specification limit

A three sigma band describes observed process variation. A specification limit describes what is acceptable. A process can be statistically stable and still fail a customer specification if the whole band sits too close to the limit.

Checklist

Is the baseline data from the same process, instrument, population, and time period as the value being checked?
Was the suspicious value kept out of the baseline mean and standard deviation calculation?
Does a histogram or normal probability plot look roughly normal?
Are there enough observations for the standard deviation to be stable?
Is the decision to investigate, quarantine, delete, or report the value defined before seeing the result?
Would a domain limit, control chart, or robust statistic answer the practical question better?

Pre-publish quality check

Yes: this article includes a real worked example with numbers. Yes: it uses H2 sections, a table, steps, and a checklist. Yes: it goes beyond a generic definition by separating normal probability, baseline construction, process-control use, and misuse cases.

Weakest Section Rewrite

Weak version: "The three sigma rule can identify unusual values."

Concrete substitution: "Use the three sigma rule when a stable fill-weight process has a baseline mean of 500.150 g and a sample standard deviation of 0.442 g. The review band is 498.824 g to 501.476 g, so a new 501.7 g bottle is not automatically bad product, but it is far enough from the baseline to check the scale, filler setting, and adjacent bottles."

Sources

References and further authoritative reading used in preparing this article.

← Центр обучения

Reading goal	What to focus on	Common mistake
Definition	What the metric is and what quantity it summarizes	Treating the formula as self-explanatory
Formula choice	Sample versus population assumptions and notation	Using n when n-1 is required or vice versa
Interpretation	Whether the result indicates concentration, spread, or risk	Calling a large value good or bad without context