Beyond Mean and Standard Deviation
While mean and standard deviation describe center and spread, skewness and kurtosis describe the shape of distributions—asymmetry and tail heaviness.
In statistics, we describe distributions using "moments"—mathematical summaries that capture different aspects of shape:
- 1st moment:Mean (central tendency)
- 2nd moment:Variance/Standard Deviation (spread)
- 3rd moment:Skewness (asymmetry)
- 4th moment:Kurtosis (tail heaviness)
Two distributions can have identical means and standard deviations yet look completely different. Skewness and kurtosis capture these differences, providing a more complete picture of your data's distribution.
Skewness: Measuring Asymmetry
Skewness measures how asymmetric a distribution is. Positive skew means a longer right tail (e.g., income distributions), while negative skew means a longer left tail.
Sample Skewness
- Skewness = 0:Symmetric distribution (normal, uniform)
- Skewness > 0:Right-skewed—mean exceeds median (income, housing prices)
- Skewness < 0:Left-skewed—median exceeds mean (age at retirement, exam scores with a ceiling)
Common Right-Skewed Data
Interpretation guidelines:
- |Skewness| < 0.5: Approximately symmetric
- 0.5 ≤ |Skewness| < 1: Moderately skewed
- |Skewness| ≥ 1: Highly skewed
Kurtosis: Tail Heaviness
Kurtosis measures how heavy or light the tails are compared to a normal distribution. High kurtosis means more extreme values (fat tails), low kurtosis means fewer.
A common misconception is that kurtosis measures "peakedness." While related, kurtosis is fundamentally about tails. A distribution with high kurtosis has more probability mass in the tails and at the peak, but less in the "shoulders."
Excess Kurtosis
- Mesokurtic (k ≈ 0):Normal-like tails (baseline for comparison)
- Leptokurtic (k > 0):Fat tails, more extreme values than normal (stock returns, earthquakes)
- Platykurtic (k < 0):Thin tails, fewer extremes than normal (uniform distribution, bounded data)
Fat Tails in Finance
Practical Applications
Risk Management: High kurtosis means more frequent extreme outcomes. VaR and other risk measures that assume normality may drastically underestimate true risk when kurtosis is high.
Quality Control: Manufacturing data with high kurtosis suggests occasional extreme deviations from target, even if average performance is acceptable. This pattern may indicate process instability requiring investigation.
Data Transformation: Highly skewed data may benefit from transformation (log, square root) before analysis. The goal is often to achieve approximate normality for statistical tests that assume it.
Statistical Testing: Many tests assume normality. Significant skewness or kurtosis may indicate this assumption is violated, suggesting use of non-parametric alternatives or robust methods.
Interpretation Guidelines
Normality Testing: The Jarque-Bera test combines skewness and kurtosis to test for normality. It rejects normality when either metric deviates significantly from zero.
Sample Size Considerations: Small samples produce unreliable skewness and kurtosis estimates. With n < 50, these statistics have high sampling variability. With n < 20, they're essentially meaningless.
Robustness: Both skewness and kurtosis are sensitive to outliers. A single extreme value can dramatically affect these statistics, so always visualize your data alongside numerical summaries.