4. Statistics and Probability

Mean And Variance Of A Distribution

Mean and Variance of a Distribution πŸ“Š

students, imagine you are tracking the number of messages a student receives each day or the time it takes buses to arrive. Some days the value is small, some days it is large, and the pattern matters more than any single result. In statistics, a distribution describes how values are spread out. In this lesson, you will learn how the mean and variance summarize the center and spread of a distribution, and why these ideas are essential in probability, data analysis, and decision-making.

What you will learn 🎯

By the end of this lesson, students, you should be able to:

  • explain the meaning of the mean and variance of a distribution,
  • calculate the mean and variance for discrete probability distributions,
  • interpret what the values tell you about the data,
  • connect these ideas to the wider IB topic of statistics and probability.

These ideas appear everywhere in the syllabus: from probability distributions to modelling random events, and they help you describe uncertainty in a clear mathematical way.

Understanding a distribution

A probability distribution lists all possible values of a random variable and the probability of each value. A random variable is a numerical outcome of a chance process. For example, let $X$ be the number of heads when flipping a coin twice. Then $X$ can take values $0$, $1$, or $2$.

A distribution is not just a list of numbers. It tells a story about how likely each result is. If one value is much more likely than the others, the distribution is concentrated. If many values have similar probabilities, the distribution is more spread out.

For a discrete random variable, the probabilities must satisfy two rules:

  • each probability is between $0$ and $1$,
  • the total probability is $1$.

That means if $P(X=x)$ is the probability that $X$ equals $x$, then:

$$\sum P(X=x)=1$$

This is one of the basic checks for a valid distribution.

The mean of a distribution

The mean of a distribution is its expected value. It gives the long-run average result if the random process is repeated many times. In IB Mathematics, the mean of a discrete random variable $X$ is written as $\mu$ or $E(X)$.

For a discrete distribution, the mean is calculated using:

$$\mu=E(X)=\sum x\,P(X=x)$$

This formula means: multiply each value by its probability, then add the results.

Example 1: Coin flips

Suppose $X$ is the number of heads in two fair coin flips. The possible values are $0$, $1$, and $2$.

The distribution is:

  • $P(X=0)=\frac{1}{4}$,
  • $P(X=1)=\frac{1}{2}$,
  • $P(X=2)=\frac{1}{4}$.

Now calculate the mean:

$$\mu=0\left(\frac{1}{4}\right)+1\left(\frac{1}{2}\right)+2\left(\frac{1}{4}\right)$$

$$\mu=0+\frac{1}{2}+\frac{1}{2}=1$$

So the mean number of heads is $1$. This does not mean you always get exactly one head. It means that over many repeated trials, the average approaches $1$.

Interpreting the mean

The mean is a balance point. If the distribution were drawn like a bar chart, the mean would be near the center of the probabilities. In real life, this can represent average daily rainfall, average number of customers, or average exam score.

For example, if a school game gives players a random number of points each round, the mean tells you the typical score in the long run. This helps compare different games or strategies.

The variance of a distribution

The variance measures how spread out the values are around the mean. A distribution with a small variance has values close to the mean. A distribution with a large variance has values far from the mean more often.

The variance of a random variable $X$ is written as $\operatorname{Var}(X)$ or $\sigma^2$. It is defined by:

$$\operatorname{Var}(X)=E\left((X-\mu)^2\right)$$

For a discrete distribution, this becomes:

$$\sigma^2=\sum (x-\mu)^2P(X=x)$$

There is also a very useful shortcut formula:

$$\sigma^2=E(X^2)-\mu^2$$

where

$$E(X^2)=\sum x^2P(X=x)$$

The square is important because it ensures that positive and negative deviations do not cancel out.

Example 2: Variance for the coin-flip distribution

Using the same distribution as before, first find $E(X^2)$:

$$E(X^2)=0^2\left(\frac{1}{4}\right)+1^2\left(\frac{1}{2}\right)+2^2\left(\frac{1}{4}\right)$$

$$E(X^2)=0+\frac{1}{2}+1=\frac{3}{2}$$

We already found $\mu=1$, so:

$$\sigma^2=E(X^2)-\mu^2=\frac{3}{2}-1^2=\frac{1}{2}$$

So the variance is $\frac{1}{2}$.

If you want the standard deviation, it is the square root of the variance:

$$\sigma=\sqrt{\sigma^2}$$

Here that gives:

$$\sigma=\sqrt{\frac{1}{2}}$$

The variance and standard deviation both describe spread, but standard deviation is in the same units as the original data, which makes it easier to interpret.

Why variance matters

Variance tells you more than the mean alone. Two distributions can have the same mean but very different spreads.

Imagine two buses:

  • Bus A usually arrives around $10$ minutes after departure, with times close to $10$ minutes.
  • Bus B also has an average arrival time of $10$ minutes, but sometimes it comes in $2$ minutes and sometimes in $18$ minutes.

Both buses have the same mean arrival time, but Bus B has a larger variance. That means Bus B is less predictable.

In statistics and probability, variance is important because it measures risk, consistency, and uncertainty. In finance, a set of investment returns with high variance is more volatile. In sports, a player with a consistent scoring pattern may have lower variance than one who scores very high one game and very low the next.

Steps for calculating mean and variance

When IB questions ask you to find the mean and variance of a discrete distribution, use this method:

  1. List the values of the random variable.
  2. List the probabilities.
  3. Check that the probabilities add to $1$.
  4. Find the mean using:

$$\mu=\sum xP(X=x)$$

  1. Find $E(X^2)$ using:

$$E(X^2)=\sum x^2P(X=x)$$

  1. Use the shortcut formula:

$$\sigma^2=E(X^2)-\mu^2$$

  1. If needed, take the square root to find the standard deviation:

$$\sigma=\sqrt{\sigma^2}$$

This procedure is efficient and reduces calculation errors.

Example 3: A simple table

Suppose $X$ has the following distribution:

  • $P(X=1)=0.2$
  • $P(X=2)=0.5$
  • $P(X=3)=0.3$

First, check the total probability:

$$0.2+0.5+0.3=1$$

Now find the mean:

$$\mu=1(0.2)+2(0.5)+3(0.3)$$

$$\mu=0.2+1.0+0.9=2.1$$

Next, find $E(X^2)$:

$$E(X^2)=1^2(0.2)+2^2(0.5)+3^2(0.3)$$

$$E(X^2)=0.2+2.0+2.7=4.9$$

Then the variance is:

$$\sigma^2=4.9-(2.1)^2=4.9-4.41=0.49$$

So the standard deviation is:

$$\sigma=\sqrt{0.49}=0.7$$

This tells you that the values are fairly close to the mean of $2.1$.

Linking mean and variance to the wider syllabus

students, these ideas are not isolated. They connect strongly to the rest of statistics and probability.

  • In data collection and statistical description, the mean and spread help summarize samples and populations.
  • In regression and correlation, understanding variation is important because data points do not sit exactly on a line.
  • In conditional probability and Bayes’ theorem, random variables and distributions help model uncertainty in real situations.
  • In discrete and continuous probability distributions, the mean and variance describe the expected behavior of random outcomes.

For continuous distributions, the same ideas exist, but summation is replaced by integration. The mean and variance are still about center and spread, just adapted to a continuous setting.

These concepts are especially important in HL work because they support mathematical modelling and interpretation. A good solution is not just about calculation; it is about explaining what the numbers mean in context.

Conclusion

The mean of a distribution gives the long-run average, while the variance measures how spread out the outcomes are around that mean. Together, they give a compact summary of a random variable. In IB Mathematics: Analysis and Approaches HL, you will use them to analyze probability distributions, compare different random processes, and interpret uncertainty in real-world contexts. Understanding these ideas helps you move from simply computing numbers to making meaningful mathematical conclusions. πŸ“˜

Study Notes

  • A distribution shows the possible values of a random variable and their probabilities.
  • For a discrete random variable $X$, the mean is $\mu=\sum xP(X=x)$.
  • The mean is the expected long-run average, not necessarily a value seen in one trial.
  • The variance is $\sigma^2=\sum (x-\mu)^2P(X=x)$.
  • A useful shortcut is $\sigma^2=E(X^2)-\mu^2$.
  • Standard deviation is $\sigma=\sqrt{\sigma^2}$.
  • Small variance means values are clustered near the mean.
  • Large variance means values are more spread out.
  • Mean and variance are central tools in probability, data analysis, and modelling.
  • These ideas connect directly to discrete and continuous distributions, correlation, regression, and decision-making under uncertainty.

Practice Quiz

5 questions to test your understanding

Mean And Variance Of A Distribution β€” IB Mathematics Analysis And Approaches HL | A-Warded