4. Statistics and Probability

Frequency Tables

Frequency Tables πŸ“Š

Introduction: why frequency tables matter

students, imagine your school wants to know how students travel to class each day πŸšΆβ€β™€οΈπŸšŒπŸš². If the teacher asks 200 students one by one, the raw answers can feel messy and hard to read. A frequency table organizes those answers so the data becomes easier to understand, compare, and use. This is one of the first and most useful tools in statistics because it helps turn a list of values into clear information.

In this lesson, you will learn how to:

  • explain the main ideas and vocabulary behind frequency tables,
  • build and interpret frequency tables,
  • connect frequency tables to graphs and statistical summaries,
  • use frequency tables as a foundation for later topics in statistics and probability.

Frequency tables are important in IB Mathematics: Analysis and Approaches HL because they help you describe data before moving on to more advanced ideas such as correlation, regression, probability distributions, and inference.

What is a frequency table?

A frequency table is a table that shows how often each value, category, or group appears in a data set. The word frequency means β€œhow many times something occurs.” If a value appears $5$ times, its frequency is $5$.

There are different kinds of frequency tables:

  • Simple frequency table: used for categories or individual values.
  • Grouped frequency table: used for numerical data spread across intervals.
  • Cumulative frequency table: shows running totals of frequencies.

For example, suppose $20$ students were asked how many hours they studied last week. Their answers might be:

$$1, 2, 2, 3, 3, 3, 4, 4, 4, 4, 5, 5, 5, 6, 6, 6, 6, 7, 8, 8$$

A simple frequency table for this data is:

| Hours studied $x$ | Frequency $f$ |

|---|---:|

| $1$ | $1$ |

| $2$ | $2$ |

| $3$ | $3$ |

| $4$ | $4$ |

| $5$ | $3$ |

| $6$ | $4$ |

| $7$ | $1$ |

| $8$ | $2$ |

The total frequency is $20$, which matches the number of students. That total is an important check because all frequencies in a table should add up to the full number of observations.

Key vocabulary and parts of a frequency table

To read and create frequency tables confidently, students, you should know these terms:

  • Data set: the full collection of values or responses.
  • Observation: one piece of data.
  • Value: a number or category in the data set.
  • Frequency: the number of times a value appears.
  • Relative frequency: the proportion of the total for a category or value.
  • Cumulative frequency: the running total of frequencies up to a certain point.

Relative frequency is often written as

$$\text{relative frequency} = \frac{f}{n}$$

where $f$ is the frequency for one class or value and $n$ is the total number of observations.

If $5$ out of $20$ students studied $5$ hours, then the relative frequency is

$$\frac{5}{20} = 0.25$$

This means $25\%$ of the students studied $5$ hours. Relative frequency is useful because it lets you compare data sets of different sizes.

Building a frequency table from raw data

When data is listed in a messy way, the first step is usually to sort it. Then you count how often each value appears. This is a common procedure in statistics.

Suppose a class recorded the number of books read in a month:

$$0, 1, 1, 2, 2, 2, 3, 3, 4, 4, 4, 4, 5$$

A frequency table can be built by listing the values from smallest to largest and counting each one.

| Books read $x$ | Frequency $f$ |

|---|---:|

| $0$ | $1$ |

| $1$ | $2$ |

| $2$ | $3$ |

| $3$ | $2$ |

| $4$ | $4$ |

| $5$ | $1$ |

A quick check is to add the frequencies:

$$1 + 2 + 3 + 2 + 4 + 1 = 13$$

Since there are $13$ data values, the table is consistent βœ…

This same process works for categories too. If students are asked their favorite sport, the values may be words instead of numbers. The table still shows frequency, but the values are categories such as basketball, soccer, or tennis.

Grouped frequency tables for numerical data

When a data set has many values or a wide range, listing every single value may not be helpful. In that case, data is grouped into intervals. This is called a grouped frequency table.

For example, if test scores range from $0$ to $100$, you might group them into intervals such as $0\text{–}9$, $10\text{–}19$, $20\text{–}29$, and so on. Grouping makes patterns easier to see.

Suppose a class test produced these scores:

$$12, 18, 19, 23, 25, 27, 31, 34, 39, 41, 44, 46, 52, 57, 63$$

A grouped frequency table might look like this:

| Score interval | Frequency $f$ |

|---|---:|

| $10\text{–}19$ | $3$ |

| $20\text{–}29$ | $3$ |

| $30\text{–}39$ | $3$ |

| $40\text{–}49$ | $3$ |

| $50\text{–}59$ | $2$ |

| $60\text{–}69$ | $1$ |

Grouped tables are useful, but they also hide some detail. For instance, you can see that scores are spread out, but you cannot tell the exact original values inside each interval. This is why grouped frequency tables are a summary tool, not a replacement for raw data.

When creating intervals, they should usually be equal in width and not overlap. Overlapping intervals would make the table confusing or incorrect.

Cumulative frequency and what it tells us

A cumulative frequency table adds the frequencies step by step. It shows how many data values are less than or equal to a given value or interval. This is especially helpful for finding medians, quartiles, and percentiles later on.

Using the books-read example:

| Books read $x$ | Frequency $f$ | Cumulative frequency $F$ |

|---|---:|---:|

| $0$ | $1$ | $1$ |

| $1$ | $2$ | $3$ |

| $2$ | $3$ | $6$ |

| $3$ | $2$ | $8$ |

| $4$ | $4$ | $12$ |

| $5$ | $1$ | $13$ |

The cumulative frequency for $4$ books is $12$, meaning $12$ students read $4$ books or fewer.

Cumulative frequency is often shown using a graph called an ogive. An ogive helps you estimate the median, quartiles, and other positions in the data. This connects frequency tables to graphical representation, which is a major part of statistics in the IB course.

Interpreting frequency tables and avoiding mistakes

A frequency table is useful only if it is interpreted correctly. students, here are common things to check:

  • Does the total frequency equal the number of observations?
  • Are the intervals clear and non-overlapping?
  • Are labels and units included?
  • Does the table match the raw data?

Example interpretation: if a grouped frequency table shows that the interval $40\text{–}49$ has frequency $3$, then $3$ students scored in that range. It does not mean the average score was $45$, and it does not tell you the exact scores.

Another important idea is that frequency tables can be used to estimate measures such as the mean for grouped data, but the result is an estimate because the original values inside each class interval are unknown. That is why frequency tables are powerful, but they also involve some loss of detail.

Why frequency tables connect to the rest of statistics and probability

Frequency tables are not just a separate topic. They are a foundation for the whole statistics and probability section.

In data collection and statistical description, frequency tables summarize raw data. In correlation and regression, you may begin by organizing paired data into a table before looking for patterns. In conditional probability and Bayes’ theorem, frequency tables can display outcomes and help count favorable cases. In discrete and continuous probability distributions, frequencies often become probabilities after division by the total number of observations.

For example, if a survey shows that $18$ out of $60$ students prefer online learning, then the relative frequency is

$$\frac{18}{60} = 0.3$$

This can be read as a probability estimate of $0.3$ based on observed data. That is one reason frequency tables are so important: they help connect real-world data to probability models.

Frequency tables also support decision-making in everyday life. A coach might use them to count training times, a company might use them to summarize customer feedback, and a scientist might use them to organize experimental results. In each case, the table helps turn data into information that can be understood quickly.

Conclusion

Frequency tables are one of the most important tools in statistics because they organize raw data into a clear form. They help you count values, compare categories, calculate relative and cumulative frequencies, and prepare for graphs and further analysis. In IB Mathematics: Analysis and Approaches HL, they are a starting point for many later ideas in statistics and probability. If you can build and interpret frequency tables accurately, students, you are building a strong foundation for the rest of the topic βœ…

Study Notes

  • A frequency table shows how often each value, category, or interval appears.
  • Frequency means the number of times something occurs.
  • The total of all frequencies should equal the number of observations.
  • Relative frequency is given by $\frac{f}{n}$.
  • Cumulative frequency is a running total of frequencies.
  • Simple frequency tables are used for individual values or categories.
  • Grouped frequency tables are used when data covers a wide range.
  • Grouped tables reduce detail but make patterns easier to see.
  • Cumulative frequency tables help with medians, quartiles, and ogives.
  • Frequency tables connect directly to graphs, data description, probability, and later statistical methods.

Practice Quiz

5 questions to test your understanding