進階進階主題·14 min

合併標準差：多組資料的整合方法

學習如何計算合併標準差，用於在 t 檢定和變異數分析 (ANOVA) 中整合多組資料。

什麼是合併標準差？

合併標準差將兩組或多組資料的變異數估計結合起來，得到一個單一的加權估計值。它在假設變異數相等的雙樣本 t 檢定中不可或缺。

概念很直觀：如果我們相信兩組資料來自具有相同內在變異性的母體，我們就可以合併它們的資料來獲得該共同變異性的更好估計。更多的資料意味著更精確的估計。

可以這樣想：如果你有 A 組 20 個觀測值和 B 組 30 個觀測值，且兩組有相同的真實變異數，你現在有 50 個觀測值來估計那個變異數，而不是從較小的樣本中分別估計。

何時合併

只有當你有理由相信各母體的變異數相等時，才應該合併標準差。合併之前，請先使用 Levene 檢定或 F 檢定來檢驗這個假設。

合併標準差公式

兩組資料的合併標準差為：

雙組合併標準差

sp = √[((n₁-1)s₁² + (n₂-1)s₂²) / (n₁+n₂-2)]

其中 n₁ 和 n₂ 是樣本數，s₁ 和 s₂ 是樣本標準差。

對於 k 組資料（如 ANOVA），公式推廣為：

多組合併標準差

sp = √[Σ(nᵢ-1)sᵢ² / Σ(nᵢ-1)]

注意公式中分子和分母都使用了 (n-1) 項。這種加權確保較大的樣本對合併估計值貢獻更多，因為較大的樣本提供更可靠的變異數估計。

基本假設

合併標準差假設變異數同質性——即所有組別共享相同的母體變異數。這個假設在以下情況最為重要：

樣本數不等（特別是當較大組別的變異數較小時問題更大）
最大與最小變異數的比值超過 2-3 倍
樣本數較小（大樣本對違反假設的情況更穩健）

當變異數不等時

如果變異數不等，請使用 Welch t 檢定而非合併 t 檢定，或使用各組自己的變異數估計。Welch 檢定不假設變異數相等，通常被建議作為預設方法。

計算範例

情境： 比較兩個班級的考試成績：

A 班：n₁ = 25，平均數 = 78，s₁ = 12
B 班：n₂ = 30，平均數 = 82，s₂ = 14

合併標準差計算：

sp = √[((25-1)(12)² + (30-1)(14)²) / (25+30-2)] sp = √[(24×144 + 29×196) / 53] sp = √[(3456 + 5684) / 53] sp = √[9140 / 53] = √172.45 = 13.13

合併標準差 13.13 落在兩組個別標準差（12 和 14）之間，且偏向樣本數較大的一方。這個合併值接下來會用於 t 檢定公式或 Cohen's d 的計算。

統計應用

獨立樣本 t 檢定： 合併標準差用於計算兩組平均數差異的標準誤差。
Cohen's d 效果量： 效果量用合併標準差來標準化：d = (M₁ - M₂) / sp
變異數分析 (ANOVA)： ANOVA 中的均方誤差 (MSE) 本質上就是所有組別的合併變異數估計。
統合分析： 合併多項研究的結果時，合併估計有助於在不同情境中標準化效果量。

How to Read This Article

A statistics tutorial is a practical interpretation guide, not just a formula dump. It refers to the assumptions, notation, and reporting language that analysts need when they explain a result to a teacher, manager, client, or reviewer. The article body covers the specific topic, while the sections below create a common interpretation frame that readers can reuse across related metrics.

Reading goal	What to focus on	Common mistake
Definition	What the metric is and what quantity it summarizes	Treating the formula as self-explanatory
Formula choice	Sample versus population assumptions and notation	Using n when n-1 is required or vice versa
Interpretation	Whether the result indicates concentration, spread, or risk	Calling a large value good or bad without context

Frequently Asked Questions

How should I interpret a high standard deviation?

A high standard deviation means the observations are spread farther from the mean on average. Whether that spread is acceptable depends on the context: wide dispersion might signal risk in finance, instability in manufacturing, or genuine natural variation in scientific data.

Why do some articles mention n while others mention n-1?

The denominator reflects the difference between population and sample formulas. Population variance and population standard deviation use N because the full dataset is known. Sample variance and sample standard deviation often use n-1 because Bessel’s correction reduces bias when estimating population spread from a sample.

What is a statistical interpretation guide?

A statistical interpretation guide is a page that moves beyond arithmetic and explains meaning. It tells you what a metric is, when the formula applies, and how to describe the result in plain English without overstating certainty.

Can I cite this article in a report?

You should cite the underlying authoritative reference for formal work whenever possible. This page is best used as an explanatory bridge that helps you understand the concept before quoting the original standard or handbook.

Why include direct citations on every article page?

Direct citations give readers a route to verify the definition, notation, and assumptions. That improves trust and reduces the chance that a simplified explanation is mistaken for the entire technical standard.

Authoritative References

These sources define the concepts referenced most often across our articles. Bessel's correction is a sample adjustment, variance is a squared measure of spread, and standard deviation is the square root of variance expressed in the same units as the data.