Important Random Variables: The Bernoulli Trial Distribution

Probability

The Analysis of Data, volume 1

0
- Front Matter
- 0.1: Contents
- 0.2: Preface
1
2
- Random Variables
- 2.1: Basic Definitions
- 2.2: Functions of RVs
- 2.3: Expectation and Variance
- 2.4: Moments and MGF
- 2.5: RVs and Measure Theory
- 2.6: Notes
- 2.7: Exercises
3
4
5
- Important Vectors
- 5.1: Multinomial Vectors
- 5.2: Gaussian Vectors
- 5.3: Dirichlet Vectors
- 5.4: Mixture Vectors
- 5.5: Exponential Family
- 5.6: Notes
- 5.7: Exercises
6
- Random Processes
- 6.1: Basic Definitions
- 6.2: Marginals
- 6.3: Moments
- 6.4: Random Walk
- 6.5: Processes and Measure
- 6.6: Borell-Cantelli and Zero-One
- 6.7: Notes
- 6.8: Exercises
7
- Important RPs
- 7.1: Markov Chains
- 7.2: Poisson Process
- 7.3: Gaussian Process
- 7.4: Notes
- 7.5: Exercises
8
A
- Set Theory
- A.1: Basic Definition
- A.2: Functions
- A.3: Cardinality
- A.4: Limits of Sets
- A.5: Notes
- A.6: Exercises
B
- Metric Spaces
- B.1: Basic Definitions
- B.2: Limits
- B.3: Continuity
- B.4: Euclidean Space
- B.5: Growth of Functions
- B.6: Notes
- B.7: Exercises
C
- Linear Algebra
- C.1: Basic Definitions
- C.2: Rank
- C.3: Eigenvalues and Determinant
- C.4: Semidefinite Matrices
- C.5: SVD
- C.6: Notes
- C.7: Exercises
D
- Differentiation
- D.1: Scalar Differentiation
- D.2: Power and Taylor Series
- D.3: Notes
- D.4: Exercises
E
- Measure Theory
- E.1: Sigma Algebras
- E.2: Measure Function
- E.3: Extension Theorem
- E.4: Independence
- E.5: Important Measures
- E.6: Measurable Functions
- E.7: Notes
F

$ \def\P{\mathsf{\sf P}} \def\E{\mathsf{\sf E}} \def\Var{\mathsf{\sf Var}} \def\Cov{\mathsf{\sf Cov}} \def\std{\mathsf{\sf std}} \def\Cor{\mathsf{\sf Cor}} \def\R{\mathbb{R}} \def\c{\,|\,} \def\bb{\boldsymbol} \def\diag{\mathsf{\sf diag}} $

3.1. The Bernoulli Trial Distribution

Below, we denote the distributions corresponding to the different random variables using abbreviations such as Ber or Bin. In cases where the distributions are parameterized, we attach the parameter or parameters to the abbreviation, for example $\text{Bin}(n,\theta)$. We use the notation $\sim$ to denote "distributed according to", for example $X\sim\text{Ber}(\theta)$ implies that the RV $X$ follows the $\text{Ber}(\theta)$ distribution.

The Bernoulli trial RV, $X\sim\text{Ber}(\theta)$, where $\theta\in[0,1]$, is characterized by the following pmf: \[p_X(x)=\begin{cases} \theta & x=1\\ 1-\theta & x=0\\0&\text{otherwise}\end{cases}, \qquad \theta\in [0,1].\] The Bernoulli trial RV may be used to characterize the probability that an experiment (or trial) that may either succeed, $X=1$, or fail, $X=0$, with probabilities $\theta$, $1-\theta$ respectively. A popular example of such an experiment is flipping a potentially biased coin, with success corresponding to heads and failure corresponding to tails.

The expectation and variance of $X\sim\text{Ber}(\theta)$ are: \begin{align*} \E(X)&=1\theta+0(1-\theta)=\theta\\ \Var(X)&=\E(X^2)-E^2(X)=1^2\theta+0^2(1-\theta)-\theta^2=\theta(1-\theta). \end{align*}

The R code below graphs the mass functions corresponding to three different $\theta$ parameters.

x = c(0, 1)
D = stack(list(`$\\theta=0.3$` = dbinom(x, 1, 0.3),
    `$\\theta=0.5$` = dbinom(x, 1, 0.5), `$\\theta=0.9$` = dbinom(x,
        1, 0.9)))
names(D) = c("mass", "theta")
D$x = x
qplot(x, mass, data = D, main = "Bernoulli pmf", geom = "point",
    stat = "identity", facets = . ~ theta, xlab = "$x$",
    ylab = "$p_X(x)$") + geom_linerange(aes(x = x,
    ymin = 0, ymax = mass))