Skewness and Kurtosis: Beyond Standard Deviation

Beyond Mean and Standard Deviation

While mean and standard deviation describe center and spread, skewness and kurtosis describe the shape of distributions—asymmetry and tail heaviness.

In statistics, we describe distributions using "moments"—mathematical summaries that capture different aspects of shape:

1st moment:Mean (central tendency)
2nd moment:Variance/Standard Deviation (spread)
3rd moment:Skewness (asymmetry)
4th moment:Kurtosis (tail heaviness)

Two distributions can have identical means and standard deviations yet look completely different. Skewness and kurtosis capture these differences, providing a more complete picture of your data's distribution.

Skewness: Measuring Asymmetry

Skewness measures how asymmetric a distribution is. Positive skew means a longer right tail (e.g., income distributions), while negative skew means a longer left tail.

Sample Skewness

g₁ = [n/((n-1)(n-2))] × Σ[(xᵢ - x̄)/s]³

Skewness = 0:Symmetric distribution (normal, uniform)
Skewness > 0:Right-skewed—mean exceeds median (income, housing prices)
Skewness < 0:Left-skewed—median exceeds mean (age at retirement, exam scores with a ceiling)

Common Right-Skewed Data

Many real-world phenomena are right-skewed: income, wealth, company sizes, city populations, insurance claims, and waiting times. In these cases, the mean is pulled higher by extreme values, making median a better measure of "typical."

Interpretation guidelines:

|Skewness| < 0.5: Approximately symmetric
0.5 ≤ |Skewness| < 1: Moderately skewed
|Skewness| ≥ 1: Highly skewed

Kurtosis: Tail Heaviness

Kurtosis measures how heavy or light the tails are compared to a normal distribution. High kurtosis means more extreme values (fat tails), low kurtosis means fewer.

A common misconception is that kurtosis measures "peakedness." While related, kurtosis is fundamentally about tails. A distribution with high kurtosis has more probability mass in the tails and at the peak, but less in the "shoulders."

Excess Kurtosis

g₂ = [n(n+1)/((n-1)(n-2)(n-3))] × Σ[(xᵢ - x̄)/s]⁴ - 3(n-1)²/((n-2)(n-3))

Mesokurtic (k ≈ 0):Normal-like tails (baseline for comparison)
Leptokurtic (k > 0):Fat tails, more extreme values than normal (stock returns, earthquakes)
Platykurtic (k < 0):Thin tails, fewer extremes than normal (uniform distribution, bounded data)

Fat Tails in Finance

Financial returns famously exhibit high kurtosis ("fat tails"). Events that should be once-in-a-century based on normal distribution assumptions occur far more frequently. Ignoring kurtosis leads to underestimating risk—a lesson from many financial crises.

Practical Applications

Risk Management: High kurtosis means more frequent extreme outcomes. VaR and other risk measures that assume normality may drastically underestimate true risk when kurtosis is high.

Quality Control: Manufacturing data with high kurtosis suggests occasional extreme deviations from target, even if average performance is acceptable. This pattern may indicate process instability requiring investigation.

Data Transformation: Highly skewed data may benefit from transformation (log, square root) before analysis. The goal is often to achieve approximate normality for statistical tests that assume it.

Statistical Testing: Many tests assume normality. Significant skewness or kurtosis may indicate this assumption is violated, suggesting use of non-parametric alternatives or robust methods.

Interpretation Guidelines

Normality Testing: The Jarque-Bera test combines skewness and kurtosis to test for normality. It rejects normality when either metric deviates significantly from zero.

Sample Size Considerations: Small samples produce unreliable skewness and kurtosis estimates. With n < 50, these statistics have high sampling variability. With n < 20, they're essentially meaningless.

Robustness: Both skewness and kurtosis are sensitive to outliers. A single extreme value can dramatically affect these statistics, so always visualize your data alongside numerical summaries.

Sources

References and further authoritative reading used in preparing this article.

← Learning Center