Calculations with the Normal Distribution
Welcome, students! 😊 In this lesson, you will learn how the normal distribution is used to calculate probabilities for real data, such as heights, test scores, and measurement errors. The normal distribution is one of the most important models in statistics because many natural and human-made measurements cluster around an average value. By the end of this lesson, you should be able to explain the key ideas, use the correct language, and solve probability problems involving the normal distribution.
What is the normal distribution?
A normal distribution is a continuous probability distribution with a bell-shaped curve. It is symmetric about its mean, so the left side mirrors the right side. The mean, median, and mode are all equal in a perfectly normal distribution. In IB Mathematics Analysis and Approaches SL, the normal distribution is usually written as $X \sim N(\mu, \sigma^2)$, where $\mu$ is the mean and $\sigma$ is the standard deviation.
The mean $\mu$ tells us the center of the data, while the standard deviation $\sigma$ tells us how spread out the values are. A small $\sigma$ means the data are tightly grouped near the mean, and a larger $\sigma$ means the data are more spread out. This matters in real life. For example, if the heights of students in a school are modeled by a normal distribution, most students will be close to the average height, while only a few will be very short or very tall.
A normal curve is continuous, so the probability that $X$ is exactly equal to one value, such as $P(X=70)$, is $0$. Instead, we find probabilities over intervals, such as $P(X<70)$ or $P(65<X<80)$. This is an important idea in probability because areas under the curve represent probability.
Standardization and the $z$-score
To calculate probabilities, it is often helpful to convert any normal variable into the standard normal distribution. The standard normal distribution has mean $0$ and standard deviation $1$, written as $Z \sim N(0,1)$. The conversion uses the $z$-score formula:
$$z=\frac{x-\mu}{\sigma}$$
This formula shows how many standard deviations a value $x$ is above or below the mean. If $z$ is positive, the value is above the mean. If $z$ is negative, the value is below the mean.
For example, suppose exam scores are normally distributed with $\mu=70$ and $\sigma=8$. If a student scores $86$, then
$$z=\frac{86-70}{8}=2$$
This means the score is $2$ standard deviations above the mean. That is quite high, because values far from the mean are less common in a normal distribution.
Standardization is useful because tables and technology are usually based on the standard normal distribution. Once a value is converted to a $z$-score, we can find probabilities using calculator functions, software, or a normal distribution table. In IB exams, you should be comfortable moving between $X$ and $Z$ and interpreting what the probabilities mean.
Finding probabilities from the normal distribution
One of the main tasks is calculating probabilities such as $P(X<a)$, $P(X>b)$, and $P(a<X<b)$. These probabilities are areas under the normal curve.
If $X \sim N(\mu, \sigma^2)$, then to find $P(X<a)$, first standardize using
$$z=\frac{a-\mu}{\sigma}$$
and then use the standard normal distribution to find $P(Z<z)$. Similarly, to find $P(a<X<b)$, calculate both $z$-scores and find the area between them.
Example: Let $X \sim N(100,15^2)$. Find $P(X<85)$.
First, standardize:
$$z=\frac{85-100}{15}=-1$$
So,
$$P(X<85)=P(Z<-1)$$
Using technology or a table, this is approximately $0.1587$. That means about $15.87\%$ of values are below $85$.
Now try an interval. Find $P(90<X<120)$ for the same distribution.
Standardize both values:
$$z_1=\frac{90-100}{15}=-\frac{2}{3}$$
$$z_2=\frac{120-100}{15}=\frac{4}{3}$$
Then the probability is
$$P\left(-\frac{2}{3}<Z<\frac{4}{3}\right)$$
Using technology, this is approximately $0.6334$. So about $63.34\%$ of values lie between $90$ and $120$.
A useful check is that probabilities must always be between $0$ and $1$. If your answer is negative or greater than $1$, something has gone wrong.
Using inverse normal calculations
Sometimes the question gives a probability and asks for a value. This is the reverse process, called inverse normal. In other words, you know the area and must find the corresponding $x$-value.
Suppose $X \sim N(50,6^2)$ and you want the value $a$ such that $P(X<a)=0.90$. You are looking for the 90th percentile.
First, find the $z$-value with left-tail area $0.90$. The corresponding standard normal value is approximately $z=1.282$. Then use the transformation backwards:
$$x=\mu+z\sigma$$
So,
$$x=50+(1.282)(6)$$
which gives
$$x\approx 57.7$$
So $90\%$ of values are below about $57.7$.
This type of calculation is very useful in real life. For example, a company may want to know the score that separates the top $10\%$ of applicants, or a school may want to find the grade needed to be in the highest group.
Remember that the direction of the inequality matters. If the question asks for the value such that $P(X>a)=0.10$, then the area to the left is $0.90$, so the same percentile logic applies.
Interpreting normal distribution results in context
Calculations are only part of the task. You also need to interpret the results in context. This is a major IB skill because statistics is about meaning, not just computation.
For example, if a manufacturer says the mass of packets of cereal is normally distributed with mean $500$ g and standard deviation $12$ g, and you find $P(X<480)=0.0478$, the interpretation is that about $4.78\%$ of packets weigh less than $480$ g. In context, that could mean a small but important number of packets are underweight.
If the question asks for a probability involving a real scenario, always write the answer as a sentence. For example: “There is a $0.1587$ probability that a randomly chosen student scores below $85$.” If percentages are more natural, you can say “about $15.87\%$.”
In many situations, the normal model is an approximation. It works well when the data are roughly symmetric and bell-shaped. If a distribution is strongly skewed, the normal distribution may not be appropriate. In IB, you should be able to recognize when a normal model is sensible and when it may not fit the data well.
Common exam skills and pitfalls
There are a few important habits to develop when working with the normal distribution.
First, always identify the parameters correctly. If $X \sim N(60,9^2)$, then the mean is $60$ and the standard deviation is $9$, not $9^2$. The variance is $9^2=81$.
Second, make sure your calculator settings match the problem. You must know whether you are using a normal cumulative distribution function or an inverse normal function. If you use technology, enter the mean, standard deviation, and bounds carefully.
Third, pay attention to the wording. The phrase “at least” means $\geq$, “at most” means $\leq$, “more than” means $>$, and “less than” means $<$. These details affect how you set up the probability.
Fourth, use symmetry when helpful. Because the normal curve is symmetric, probabilities on one side can often be found from the other side. For instance,
$$P(Z>1.5)=1-P(Z<1.5)$$
This can make calculations faster and reduce mistakes.
Fifth, know the empirical rule as a rough guide. For a normal distribution, about $68\%$ of data lie within $1\sigma$ of the mean, about $95\%$ within $2\sigma$, and about $99.7\%$ within $3\sigma$. This is not exact, but it helps you estimate whether an answer seems reasonable.
Why this topic matters in statistics and probability
Calculations with the normal distribution connect directly to the wider topic of statistics and probability. Data collection gives us measurements. Statistical description helps us summarize those measurements using mean, standard deviation, and graphs. Probability then lets us make predictions about future values or assess how unusual a result is.
The normal distribution is also connected to the idea of modeling. A model is a simplified mathematical description of real data. In statistics, a good model helps us answer questions such as: How likely is a certain result? What score is needed to be in the top group? Is a value unusual enough to deserve attention?
This topic also supports later work in hypothesis testing and inference, where the normal distribution often appears again. That is why it is important to be confident with probabilities, percentiles, and interpretation now, students.
Conclusion
The normal distribution is a powerful tool for describing and analyzing data that cluster around a mean. To work with it successfully, you need to understand the notation $X \sim N(\mu, \sigma^2)$, use the $z$-score formula, find probabilities for intervals and tails, and work backwards with inverse normal calculations. Most importantly, you must interpret answers in context and check whether the normal model makes sense. With practice, these calculations become a reliable way to solve real-world problems in IB Mathematics Analysis and Approaches SL. ✅
Study Notes
- A normal distribution is a continuous, bell-shaped, symmetric distribution.
- The notation $X \sim N(\mu, \sigma^2)$ means the mean is $\mu$ and the variance is $\sigma^2$.
- The standard normal distribution is $Z \sim N(0,1)$.
- Use $z=\frac{x-\mu}{\sigma}$ to standardize values.
- For probabilities, areas under the curve represent chance.
- $P(X=a)=0$ for a continuous random variable.
- To find probabilities, convert to $Z$ and use technology or a table.
- To find a value from a probability, use inverse normal and then $x=\mu+z\sigma$.
- Read inequality words carefully: “at least,” “at most,” “more than,” and “less than.”
- Interpret answers in context, using sentences and percentages when appropriate.
- The empirical rule gives a quick estimate: $68\%$, $95\%$, and $99.7\%$.
- The normal distribution is widely used in statistics because many real measurements are approximately normal.
