Binomial Distribution
Summary
The binomial distribution models the probability of a specific number of successes in a fixed number of independent Bernoulli trials, each with the same probability of success.
Definition
A binomial distribution describes the number of successes in a sequence of independent trials, where each trial has two possible outcomes: success or failure. The probability of success in each trial is denoted by .
The binomial probability mass function (PMF) is given by:
where:
- is the number of trials,
- is the number of successes,
- is the probability of success on a single trial,
- is the binomial coefficient, representing the number of ways to choose successes from trials.
Properties
-
Mean: The expected value (mean) of a binomial distribution is given by:
-
Variance: The variance of a binomial distribution is:
-
Support: The possible values of range from to , where is the number of successes.
Example
Example: Tossing a Coin
Suppose you flip a fair coin () 10 times. What is the probability of getting exactly 6 heads?
Using the binomial formula:
This simplifies to:
Key Points
Key Characteristics of Binomial Distribution
- Trials are independent.
- Each trial has exactly two outcomes.
- Probability of success remains constant across trials.
Common Use Cases
- Quality control: Checking defective products in a batch.
- Clinical trials: Success rate of a treatment.
- Survey analysis: Probability of a certain number of people agreeing with a statement.
Cumulative Binomial Probability
In practice, it is often useful to compute the probability of being less than or equal to a certain value. The cumulative distribution function (CDF) for the binomial distribution is:
This can be used, for example, to determine the likelihood of obtaining up to a certain number of successes in trials.
Inverse Binomial Distribution
An inverse binomial distribution (or negative binomial distribution) models the number of failures needed before achieving successes, where is a fixed number of successes. Its probability mass function is:
Here, represents the number of failures before the success.
Conceptual Question
How many trials are needed, on average, to achieve successes if the probability of success on each trial is ?
The expected value for the total number of trials is:
Practical Considerations
-
The binomial distribution assumes independence between trials. If the trials are dependent (e.g., without replacement), consider using a hypergeometric distribution.
-
For large and small , the Poisson distribution can approximate the binomial distribution when .
-
The normal distribution can approximate the binomial distribution when is large, using the Central Limit Theorem:
This approximation is particularly useful for computationally expensive problems.