Overview
One of the most common questions in statistics is: "Should I divide by n or n-1?" The answer depends on whether you're working with an entire population or just a sample.
Population (N)
Sample (n-1)
Population Standard Deviation (σ)
Population standard deviation is used when you have measurements from every single member of the group you're analyzing. This is relatively rare in practice.
Examples of True Populations:
- All 50 employees in a small company
- Every student in a specific class of 30
- All transactions in a closed fiscal year
- Complete census data for a country
Sample Standard Deviation (s)
Sample standard deviation is used when you're working with a subset of a larger population. This is the more common scenario in real-world analysis.
Examples of Samples:
- Surveying 1,000 voters to predict election results
- Testing 50 products from a production batch of 10,000
- Measuring blood pressure of 200 patients in a clinical study
- Analyzing 5 years of stock data to predict future volatility
Bessel's Correction Explained
Bessel's correction is the reason we use (n-1) instead of n when calculating sample standard deviation. Named after German mathematician Friedrich Bessel, this adjustment produces an unbiased estimate of the population variance.
Why (n-1) Works
Mathematical Intuition
Sample data points tend to cluster closer to the sample mean than to the true population mean. This causes the sum of squared deviations to be systematically smaller than it should be.
Dividing by (n-1) instead of n inflates the result slightly, compensating for this underestimation and producing an unbiased estimate.
When to Use Each
| Scenario | Use | Divide By |
|---|---|---|
| You have all data points in existence | Population SD (σ) | N |
| You're describing only the data you have | Population SD (σ) | N |
| You're estimating for a larger population | Sample SD (s) | n-1 |
| You'll use SD for inferential statistics | Sample SD (s) | n-1 |
Rule of Thumb