Important Random Variables: The Exponential Distribution

Probability

The Analysis of Data, volume 1

0
- Front Matter
- 0.1: Contents
- 0.2: Preface
1
2
- Random Variables
- 2.1: Basic Definitions
- 2.2: Functions of RVs
- 2.3: Expectation and Variance
- 2.4: Moments and MGF
- 2.5: RVs and Measure Theory
- 2.6: Notes
- 2.7: Exercises
3
4
5
- Important Vectors
- 5.1: Multinomial Vectors
- 5.2: Gaussian Vectors
- 5.3: Dirichlet Vectors
- 5.4: Mixture Vectors
- 5.5: Exponential Family
- 5.6: Notes
- 5.7: Exercises
6
- Random Processes
- 6.1: Basic Definitions
- 6.2: Marginals
- 6.3: Moments
- 6.4: Random Walk
- 6.5: Processes and Measure
- 6.6: Borell-Cantelli and Zero-One
- 6.7: Notes
- 6.8: Exercises
7
- Important RPs
- 7.1: Markov Chains
- 7.2: Poisson Process
- 7.3: Gaussian Process
- 7.4: Notes
- 7.5: Exercises
8
A
- Set Theory
- A.1: Basic Definition
- A.2: Functions
- A.3: Cardinality
- A.4: Limits of Sets
- A.5: Notes
- A.6: Exercises
B
- Metric Spaces
- B.1: Basic Definitions
- B.2: Limits
- B.3: Continuity
- B.4: Euclidean Space
- B.5: Growth of Functions
- B.6: Notes
- B.7: Exercises
C
- Linear Algebra
- C.1: Basic Definitions
- C.2: Rank
- C.3: Eigenvalues and Determinant
- C.4: Semidefinite Matrices
- C.5: SVD
- C.6: Notes
- C.7: Exercises
D
- Differentiation
- D.1: Scalar Differentiation
- D.2: Power and Taylor Series
- D.3: Notes
- D.4: Exercises
E
- Measure Theory
- E.1: Sigma Algebras
- E.2: Measure Function
- E.3: Extension Theorem
- E.4: Independence
- E.5: Important Measures
- E.6: Measurable Functions
- E.7: Notes
F

$ \def\P{\mathsf{\sf P}} \def\E{\mathsf{\sf E}} \def\Var{\mathsf{\sf Var}} \def\Cov{\mathsf{\sf Cov}} \def\std{\mathsf{\sf std}} \def\Cor{\mathsf{\sf Cor}} \def\R{\mathbb{R}} \def\c{\,|\,} \def\bb{\boldsymbol} \def\diag{\mathsf{\sf diag}} $

3.8. The Exponential Distribution

The exponential RV, $X\sim \text{Exp}(\lambda)$, where $\lambda > 0$, has the pdf \[f_X(x)=\begin{cases}\lambda e^{-\lambda x} & x > 0\\ 0 &\text{otherwise}\end{cases}.\] Since the pdf decreases exponentially as $x$ grows (for positive $x$), it is more probable that $X$ will receive a small positive value than a large positive value. The cdf is \[F_X(x)=\P(X\leq x)=\begin{cases} \int_0^x \lambda e^{-\lambda x}=-e^{-\lambda x}\Big|_0^x=1-e^{-\lambda x} & x > 0 \\ 0 & \text{otherwise}\end{cases}.\]

The exponential RV is the only continuous RV $X$ with the memoryless property: the probability that $X$ is larger than $s+t$ is the same as the probability that $X$ is larger than $s$ in one experiment and an independent copy of $X$ is larger than $t$ in an independent experiment \begin{align} \P(X > s+t) = \P(X > s)\P(X > t). \end{align} (The equation above holds for the exponential RV since $\P(X > t)=1-F_X(t)=e^{-\lambda x}$.) The term "memoryless" is motivated by noting that $\P( X >s+t) = \P(X > s)\P(X > t)$ implies the following lack of memory: \begin{align*} \P(X > t+h|X > t)&=\frac{\P(\{X > t+h\}\cap\{X > t\})}{\P(X > t)} =\frac{\P(\{X > t+h\})}{\P(X > t)}\\ &=\frac{e^{-\lambda(t+h)}}{e^{-\lambda t}}=e^{-\lambda h}=\P(X > h). \end{align*} A proof that no other continuous distribution has this property is available for example in (Feller, 1968). The memoryless property motivates the use of the exponential RV to model times between successive arrivals of customers at a store, cars at an intersection, or phone calls at a switchboard.

The mgf of an exponential RV is \begin{align*} m(t)&=\E(\exp(tX))=\lambda\int_0^{\infty} e^{-\lambda x} e^{tx}\,dx= \lambda\int_0^{\infty} e^{(t-\lambda)x} \,dx =\frac{\lambda}{t-\lambda} e^{(t-\lambda)x}\Big|_0^{\infty}\\ &=\frac{\lambda}{\lambda-t} \end{align*} for $t<\lambda$, implying that \begin{align*} \E(X)& = m'(0) = \frac{\lambda}{(\lambda-t)^2}\Big|_{t=0} = \lambda^{-1},\\ \Var(X) &= m''(0) = \frac{\lambda}{(\lambda-t)^3}\Big|_{t=0} = \lambda^{-2}. \end{align*}

The R code below graphs the pdf and cdf of exponential RVs with different parameter values.

x = seq(0, 3, length = 100)
y1 = dexp(x, 1)
y2 = dexp(x, 3)
y3 = pexp(x, 1)
y4 = pexp(x, 3)
D = data.frame(probability = c(y1, y2, y3, y4), x = x)
D$parameter[1:100] = "$\\lambda=1$"
D$parameter[101:200] = "$\\lambda=2$"
D$parameter[201:300] = "$\\lambda=1$"
D$parameter[301:400] = "$\\lambda=2$"
D$type[1:200] = "$f_X(x)$"
D$type[201:400] = "$F_X(x)$"
qplot(x, probability, data = D, geom = "area", facets = parameter ~
    type, xlab = "$x$", ylab = "", main = "Exponential pdf and cdf functions")