Binomial Distribution

Summary

The binomial distribution models the probability of a specific number of successes in a fixed number of independent Bernoulli trials, each with the same probability of success.


Definition

A binomial distribution describes the number of successes in a sequence of independent trials, where each trial has two possible outcomes: success or failure. The probability of success in each trial is denoted by .

The binomial probability mass function (PMF) is given by:

where:

  • is the number of trials,
  • is the number of successes,
  • is the probability of success on a single trial,
  • is the binomial coefficient, representing the number of ways to choose successes from trials.

Properties

  • Mean: The expected value (mean) of a binomial distribution is given by:

  • Variance: The variance of a binomial distribution is:

  • Support: The possible values of range from to , where is the number of successes.


Example

Example: Tossing a Coin

Suppose you flip a fair coin () 10 times. What is the probability of getting exactly 6 heads?

Using the binomial formula:

This simplifies to:


Key Points

Key Characteristics of Binomial Distribution

  • Trials are independent.
  • Each trial has exactly two outcomes.
  • Probability of success remains constant across trials.

Common Use Cases

  • Quality control: Checking defective products in a batch.
  • Clinical trials: Success rate of a treatment.
  • Survey analysis: Probability of a certain number of people agreeing with a statement.

Cumulative Binomial Probability

In practice, it is often useful to compute the probability of being less than or equal to a certain value. The cumulative distribution function (CDF) for the binomial distribution is:

This can be used, for example, to determine the likelihood of obtaining up to a certain number of successes in trials.


Inverse Binomial Distribution

An inverse binomial distribution (or negative binomial distribution) models the number of failures needed before achieving successes, where is a fixed number of successes. Its probability mass function is:

Here, represents the number of failures before the success.

Conceptual Question

How many trials are needed, on average, to achieve successes if the probability of success on each trial is ?

The expected value for the total number of trials is:


Practical Considerations

  • The binomial distribution assumes independence between trials. If the trials are dependent (e.g., without replacement), consider using a hypergeometric distribution.

  • For large and small , the Poisson distribution can approximate the binomial distribution when .

  • The normal distribution can approximate the binomial distribution when is large, using the Central Limit Theorem:

This approximation is particularly useful for computationally expensive problems.