Important Random Variables: The Geometric Distribution

Probability

The Analysis of Data, volume 1

0
- Front Matter
- 0.1: Contents
- 0.2: Preface
1
2
- Random Variables
- 2.1: Basic Definitions
- 2.2: Functions of RVs
- 2.3: Expectation and Variance
- 2.4: Moments and MGF
- 2.5: RVs and Measure Theory
- 2.6: Notes
- 2.7: Exercises
3
4
5
- Important Vectors
- 5.1: Multinomial Vectors
- 5.2: Gaussian Vectors
- 5.3: Dirichlet Vectors
- 5.4: Mixture Vectors
- 5.5: Exponential Family
- 5.6: Notes
- 5.7: Exercises
6
- Random Processes
- 6.1: Basic Definitions
- 6.2: Marginals
- 6.3: Moments
- 6.4: Random Walk
- 6.5: Processes and Measure
- 6.6: Borell-Cantelli and Zero-One
- 6.7: Notes
- 6.8: Exercises
7
- Important RPs
- 7.1: Markov Chains
- 7.2: Poisson Process
- 7.3: Gaussian Process
- 7.4: Notes
- 7.5: Exercises
8
A
- Set Theory
- A.1: Basic Definition
- A.2: Functions
- A.3: Cardinality
- A.4: Limits of Sets
- A.5: Notes
- A.6: Exercises
B
- Metric Spaces
- B.1: Basic Definitions
- B.2: Limits
- B.3: Continuity
- B.4: Euclidean Space
- B.5: Growth of Functions
- B.6: Notes
- B.7: Exercises
C
- Linear Algebra
- C.1: Basic Definitions
- C.2: Rank
- C.3: Eigenvalues and Determinant
- C.4: Semidefinite Matrices
- C.5: SVD
- C.6: Notes
- C.7: Exercises
D
- Differentiation
- D.1: Scalar Differentiation
- D.2: Power and Taylor Series
- D.3: Notes
- D.4: Exercises
E
- Measure Theory
- E.1: Sigma Algebras
- E.2: Measure Function
- E.3: Extension Theorem
- E.4: Independence
- E.5: Important Measures
- E.6: Measurable Functions
- E.7: Notes
F

$ \def\P{\mathsf{\sf P}} \def\E{\mathsf{\sf E}} \def\Var{\mathsf{\sf Var}} \def\Cov{\mathsf{\sf Cov}} \def\std{\mathsf{\sf std}} \def\Cor{\mathsf{\sf Cor}} \def\R{\mathbb{R}} \def\c{\,|\,} \def\bb{\boldsymbol} \def\diag{\mathsf{\sf diag}} $

3.3. The Geometric Distribution

The geometric RV, $X\sim\text{Geom}(\theta)$, where $\theta\in[0,1]$, is the number of failures we encounter in a sequence of independent Bernoulli experiments with parameter $\theta$ before encountering success. The pmf of the geometric RV $X\sim\text{Geom}(\theta)$ is \[p_X(x)=\begin{cases}\theta(1-\theta)^x & x\in\mathbb{N}\cup\{0\}\\ 0 &\text{otherwise}\end{cases}.\]

Using the power series formula (see Section D.2) we can ascertain that $P(X\in\Omega)=1$: \[\sum_{n=0}^{\infty} p_X(n)=\theta(1+(1-\theta)+(1-\theta)^2+\cdots)=\theta \frac{1}{1-(1-\theta)}=1.\] Using geometric series formulas (see Section D.2) we derive \begin{align*} \E(X)&=\theta \sum_{n=0}^{\infty}n(1-\theta)^n = \frac{1}{\theta^2}-\frac{1}{\theta}= \theta\frac{1-\theta}{\theta^2} =\frac{1-\theta}{\theta}, \\ \Var(X) &= \E(X^2)-(\E(X))^2= (1-\theta)/\theta^2 = \theta \sum_{n=0}^{\infty}n^2(1-\theta)^n - \frac{(1-\theta)^2}{\theta^2}\\ &=\theta\frac{2(1-\theta)}{\theta^3} - \theta\frac{1-\theta}{\theta^2} -\frac{(1-\theta)^2}{\theta^2} =\frac{2-2\theta-\theta+\theta^2-1+2\theta-\theta^2}{\theta^2} =\frac{1-\theta}{\theta^2}. \end{align*}

The R code below graphs the pmf of a geometric RV. In accordance with our intuition, it shows that as $\theta$ increases, $X$ is less likely to get high values.

x = 0:9
D = stack(list(`$\\theta=0.3$` = dgeom(x, 0.3), `$\\theta=0.5$` = dgeom(x,
    0.5), `$\\theta=0.7$` = dgeom(x, 0.7)))
names(D) = c("mass", "theta")
D$x = x
qplot(x, mass, data = D, , main = "Geometric pmf",
    geom = "point", stat = "identity", facets = theta ~
        ., xlab = "$x$", ylab = "$p_X(x)$") + geom_linerange(aes(x = x,
    ymin = 0, ymax = mass))