Lean guide for computing and interpreting standard scores (Z) so you can compare values across normal (or approximately normal) distributions and assess rarity, percentiles, and decision thresholds.
1. Core Formulas
Population: Z = (X − μ) / σ
Sample: Z = (X − x̄) / s (use only as an approximation; small n ⇒ prefer t)
Reverse (raw from Z): X = μ + Z·σ
One‑sample Z test: Z = (x̄ − μ₀) / (σ / √n)
Two means (known σ's): Z = (x̄₁ − x̄₂) / √(σ₁²/n₁ + σ₂²/n₂)
Percentile ≈ Φ(Z); Tail prob ≈ 1 − Φ(Z) (use normal CDF)
2. Quick Decision Bands (Reference Only)
|Z| < 1 ⇒ Typical (≈ 16th–84th pct)
1 ≤ |Z| < 2 ⇒ Notable but common (≈ 84th–97.5th or 2.5th–16th)
Negative Z? Value is below the mean; magnitude still counts distance.
When not to use Z? Highly skewed data with small n; ordinal scales; unknown σ with tiny sample.
Z vs t? t replaces Z when σ unknown & n small; converges to Z as df grows.
Can I compare different tests? Yes if each test’s distribution is (approx) normal & standardized separately.
9. Action Tip
Before interpreting an extreme Z (|Z| ≥ 3), re‑check data entry, unit consistency, and distribution shape—half of “signals” at this level in applied settings are data quality artifacts, not true phenomena.