4. Probability Distributions: Common Distributions
The Binomial Distribution
Binomial Setting
In a binomial setting:
- There are independent observations
- Each observation only has possible outcomes: 'success' or 'failure'.
- The probability of a success, denoted , is the same for each observation.
Binomial Distribution
Let denote the number of successes among observations in a binomial setting.
Then is a discrete random variable with as possible outcomes.
We say has the binomial distribution with parameters and , and we write .
Flip a coin times and define a success as the coin coming up Heads.
Let be the number of successes among the observations.
Then is binomially distributed with and , denoted
Suppose of people in a large population are smokers. Choose people at random and ask them: "Are you a smoker"? Define a person answering "Yes" as a success.
Let be the number of successes among the observations.
Then is binomially distributed with and , denoted
(This is approximately binomial because the total population is large relative to the sample size.)
Computation of Binomial Probabilities with Statistical Software
Let be a binomial random variable with parameters and .
To compute in Excel, make use of the following function:
BINOM.DIST(x, n, p, cumulative)
- x: The number of successes.
- n: The number of trials.
- p: The probability of success of each trial.
- cumulative: A logical value that determines the form of the function.
- TRUE - uses the cumulative distribution (at most x successes),
- FALSE - uses the probability mass function (exactly x successes),
To compute in R, make use of the following function:
dbinom(x, size, prob)
- x: The number of successes.
- size: The number of trials.
- prob: The probability of success of each trial.
Compute . Round your answer to decimal places.
There are a number of different ways we can calculate . Click on one of the panels to toggle a specific solution.
To calculate in Excel, make use of the following function:
Thus, to calculate , run the following command:BINOM.DIST(x, n, p, cumulative)
- x: The number of successes.
- n: The number of trials.
- p: The probability of success of each trial.
- cumulative: A logical value that determines the form of the function.
- TRUE - uses the cumulative distribution (at most x successes),
- FALSE - uses the probability mass function (exactly x successes),
This gives:
To calculate in R, make use of the following function:
Thus, to calculate , run the following command:dbinom(x, size, prob)
- x: The number of successes.
- size: The number of trials.
- prob: The probability of success of each trial.
This gives:
Cumulative Distribution
For any random variable , the cumulative distribution of is for any number .
Since binomial variables are discrete, the following properties regarding cumulative probabilities hold:
Additionally, for any values and in the range of (with ):
Computation of Cumulative Binomial Probabilities with Statistical Software
Let be a binomial random variable with parameters and .
To calculate cumulative probabilities for a binomial distribution in Excel, make use of the following function:
BINOM.DIST(x, n, p, cumulative)
- x: The number of successes.
- n: The number of trials.
- p: The probability of success of each trial.
- cumulative: A logical value that determines the form of the function.
- TRUE - uses the cumulative distribution (at most x successes),
- FALSE - uses the probability mass function (exactly x successes),
To calculate cumulative probabilities for a binomial distribution in R, make use of the following function:
pbinom(q, size, prob)
- q: The number of successes.
- size: The number of trials.
- prob: The probability of success of each trial.
There are a number of different ways we can calculate . Click on one of the panels to toggle a specific solution.
To calculate in Excel, make use of the following function:
BINOM.DIST(x, n, p, cumulative)
- x: The number of successes.
- n: The number of trials.
- p: The probability of success of each trial.
- cumulative: A logical value that determines the form of the function.
- TRUE - uses the cumulative distribution (at most x successes),
- FALSE - uses the probability mass function (exactly x successes),
Thus, to calculate , run the following command:
This gives:
To calculate in R, make use of the following function:
Thus, to calculate , run the following command:pbinom(q, size, prob)
- q: The number of successes.
- size: The number of trials.
- prob: The probability of success of each trial.
This gives: