What is the geometric mean?

The geometric mean of n values is the nth root of their product. It is the correct average when values compound multiplicatively, such as investment returns or growth rates.

When should I use geometric mean instead of arithmetic mean?

Use geometric mean whenever values multiply together to produce a result — compound interest, CAGR, population growth, viral R₀. For additive quantities like temperatures or test scores, use arithmetic mean.

Why does a +50% gain followed by a −50% loss not break even?

Because gains and losses compound asymmetrically: 1.5 × 0.5 = 0.75, a 25% net loss. The geometric mean of [1.5, 0.5] is √0.75 ≈ 0.866, reflecting the actual −13.4% annualised loss, whereas the arithmetic mean of 1.0 implies no change.

How do I compute geometric mean without floating-point overflow?

Use exp(mean(log(x))) instead of computing the raw product. This transforms multiplication into addition in log space, avoiding overflow or underflow for large datasets.

Glossary

Geometric mean

The right mean for compounding rates

By Buğra SözeriPublished May 16, 2026Updated May 31, 2026

The geometric mean is the nth root of the product of n values: GM = (x₁ × x₂ × ... × xₙ)^(1/n). Unlike the arithmetic mean, the geometric mean is the right tool for averaging ratios, percentages, and compounding growth rates.

Why it matters: imagine an investment that returns +50% in year 1 and −50% in year 2. The arithmetic mean of [1.5, 0.5] is 1.0 — suggesting the investment broke even. But $100 invested becomes $150 after year 1 and $75 after year 2 — a 25% loss. The geometric mean of [1.5, 0.5] is √(1.5 × 0.5) = √0.75 ≈ 0.866, correctly indicating a 13.4% annualised loss.

Standard use cases:

Compound annual growth rate (CAGR). The geometric mean of the year-over-year ratios.
Index numbers. Stock indices weighted by ratios use geometric means to avoid arithmetic-mean distortion.
Ratio-of-ratios comparisons. When the inputs are multiplicative rather than additive (gains compounding, dosing).

The geometric mean is always ≤ the arithmetic mean. Equality holds only when all values are identical. For widely varying values, the gap can be large — which is precisely why using the right one matters.

Worked example

You invest $10,000 in a fund. Returns: year 1 +30%, year 2 −20%, year 3 +25%, year 4 −10%, year 5 +15%. Arithmetic mean of [30, −20, 25, −10, 15] = 8% — what the marketing brochure would quote. Now compound the actual money: 10,000 × 1.30 × 0.80 × 1.25 × 0.90 × 1.15 = 10,000 × 1.3455 = $13,455. Total gain 34.55% over five years. The geometric mean: (1.30 × 0.80 × 1.25 × 0.90 × 1.15)^(1/5) = 1.3455^0.2 ≈ 1.0612 — a 6.12% annualised return. That’s the “CAGR” the SEC requires US mutual funds to report instead of the arithmetic mean, because reporting 8% would imply 1.08⁵ × 10,000 = $14,693 — overstating actual performance by 9%. The mismatch grows with return volatility: two assets with the same arithmetic-mean return can have very different CAGR if one is more volatile.

One more illustrative case: an algorithmic-trading strategy backtest reports returns of [+40%, +30%, −50%, +20%, +10%] across five years. Arithmetic mean: 10% per year. Geometric mean: (1.40 × 1.30 × 0.50 × 1.20 × 1.10)^(1/5) ≈ 1.2012^0.2 ≈ 1.037 — only 3.7% annualised. The −50% year dominates everything; a single deep drawdown crushes the long-run return regardless of subsequent gains. This is why hedge-fund disclosures emphasise “maximum drawdown” alongside annualised returns.

When and why it matters

Geometric mean matters in any context where the numbers compound: investment returns, biological growth rates, viral spreading (R₀), and depreciation. The arithmetic-mean error is largest when individual values vary widely — symmetric ±50% returns produce a net loss because gains and losses compound asymmetrically (a 50% loss requires a 100% gain to recover). The same lesson applies to weight loss percentages, bacterial colony counts, and any “average rate of change” reported across multiple time periods. The opposite mistake — using geometric mean for additive quantities like daily temperatures or test scores — produces meaningless numbers. The rule: if the values multiply to produce the final result, use geometric mean; if they add, use arithmetic. For mixed scenarios (Sharpe ratios, weighted averages), specialised formulas exist. Reference: Investopedia — Compound Annual Growth Rate.

The numerical-stability trick — work in log space: computing the product of many values directly overflows or underflows IEEE 754 doubles quickly (10 values of 0.1 each produces 10⁻¹⁰, fine; 50 values produces 10⁻⁵⁰, fine; 500 values produces 0.0, definitely not fine). The standard fix is to compute exp(mean(log(x))) instead. Logarithms turn the product into a sum, the sum stays within representable range, and the final exp recovers the geometric mean. Every statistical library (NumPy, SciPy, R) implements geometric mean this way internally; rolling your own naïvely is one of the more common ways to silently produce zeros for large samples.

Why the geometric mean works for ratios — the intuition: ratios live on a logarithmic scale, not a linear one. “Doubled” (×2) and “halved” (×0.5) are symmetric multiplicative changes — they should average to “no change” (×1), not to “1.25×” as the arithmetic mean would suggest. The geometric mean respects this symmetry because it computes the average logarithm of the ratios. This is why finance, biology (cell-growth rates), and physics (radioactive decay) all default to geometric means for time-series ratios. Related: harmonic mean, arithmetic mean, compound interest. Reference: NIST/SEMATECH e-Handbook — Geometric Mean.

Frequently asked questions

What is the geometric mean?: The geometric mean of n values is the nth root of their product. It is the correct average when values compound multiplicatively, such as investment returns or growth rates.
When should I use geometric mean instead of arithmetic mean?: Use geometric mean whenever values multiply together to produce a result — compound interest, CAGR, population growth, viral R₀. For additive quantities like temperatures or test scores, use arithmetic mean.
Why does a +50% gain followed by a −50% loss not break even?: Because gains and losses compound asymmetrically: 1.5 × 0.5 = 0.75, a 25% net loss. The geometric mean of [1.5, 0.5] is √0.75 ≈ 0.866, reflecting the actual −13.4% annualised loss, whereas the arithmetic mean of 1.0 implies no change.
How do I compute geometric mean without floating-point overflow?: Use exp(mean(log(x))) instead of computing the raw product. This transforms multiplication into addition in log space, avoiding overflow or underflow for large datasets.

Published May 16, 2026 · Last reviewed May 31, 2026