Skip to content

Geometric Distribution

Core Concept

The Geometric distribution models the number of independent Bernoulli trials until the first success.

  • A Bernoulli trial is an experiment with two outcomes: success (1) or failure (0).
  • Trials are independent (e.g., coin flips, dice rolls).
  • The distribution answers: “What is the probability that the first success occurs on the \(n\)-th trial?”

Definitions

Let:

  • \(p\) = probability of success on a single trial
  • \(q = 1 - p\) = probability of failure
  • \(X\) = number of trials until the first success

Then:

\[ X \sim \text{Geometric}(p) \]

The Probability Mass Function (PMF) is:

\[ \begin{aligned} P(\text{first success on the } n\text{th trial}) &= P(F_1 \cap F_2 \cap \dots \cap F_{n-1} \cap S_n) \\ &= P(F)^{n - 1} \cdot P(S) \quad \\ &= (1 - p)^{n - 1} \cdot p \\ &= q^{n - 1} \cdot p \\ \text{where } q &= 1 - p, \\ \text{and each } F_i &\sim \text{Bernoulli(0), and } S_n \sim \text{Bernoulli(1)}) \end{aligned} \]

Intuition

To succeed for the first time on the \(n\)-th trial:

  • First \(n - 1\) trials must be failures (\(q^{n - 1}\))
  • The \(n\)-th trial must be a success (\(p\))

So:

\[ P(X = n) = q^{n - 1} \cdot p = (1 - p)^{n - 1} \cdot p \]
Type Formula
Support \(x \in \{1, 2, 3, \dots\}\)
Mean \(\mathbb{E}[X] = \frac{1}{p}\)
Variance \(\text{Var}(X) = \frac{1 - p}{p^2}\)
Standard Deviation \(\sigma = \frac{\sqrt{1 - p}}{p}\)
PMF (Probability Mass Function) \(\mathbb{P}(X = k) = (1 - p)^{k - 1} p\)
CDF (Cumulative Distribution Function) \(\mathbb{P}(X \leq k) = 1 - (1 - p)^k\)
Memoryless Property \(\mathbb{P}(X > m + n \mid X > m) = \mathbb{P}(X > n)\)

Example

Question: What is the probability that it takes at least 10 rolls of a fair die to roll a 5?

  • Success = rolling a 5
  • \(p = \frac{1}{6}, q = \frac{5}{6}\)
  • \(X \sim \text{Geometric}\left(\frac{1}{6}\right)\)

We want: \(P(X \geq 10)\)

Method 1: Summing Infinite Tail

\[ P(X \geq 10) = \sum_{k=10}^{\infty} P(X = k) = \sum_{k=10}^{\infty} q^{k - 1} \cdot p \]

Factor out \(q^9 \cdot p\):

\[ P(X \geq 10) = q^9 \cdot p \cdot \left(1 + q + q^2 + \dots \right) \]

This is a geometric series, sum of an infinite geometric series where:

  • The first term is 1,
  • The common ratio is \(q\),
  • And \(∣q∣<1\) (so the series converges)
\[ p\sum_{k=0}^{\infty} q^k = \frac{a}{1 - r} = \frac{1}{1 - q} = \frac{1}{p} \]

So:

\[ P(X \geq 10) = q^9 \cdot p \cdot \frac{1}{p} = q^9 \]

Final answer:

\[ P(X \geq 10) = \left( \frac{5}{6} \right)^9 \]

Method 2: No Success in First 9 Trials

\[ P(X \geq 10) = P(\text{no success in first 9 trials}) = q^9 = \left( \frac{5}{6} \right)^9 \]

Same result, simpler reasoning.

When to Use the Poisson Distribution

  • Number of independent trials until the first success.
  • Each trial has the same success probability \(p\).
  • Trials are independent Bernoulli (success/failure) events.