9. Topic 9(COLON) Descriptive Statistics and Data Handling

Lesson 9.2: Representing Data

Official syllabus section covering Lesson 9.2: Representing data within Topic 9: Descriptive Statistics and Data Handling: Frequency tables, bar charts, histograms and pie charts.; Cumulative-frequency curves and box plots..

Lesson 9.2: Representing Data

Introduction

In this lesson, we will explore the various methods of representing data through visual aids. Understanding how to represent data accurately is crucial for interpreting quantitative information efficiently. After this lesson, you will be able to construct and interpret frequency tables, bar charts, histograms, pie charts, cumulative-frequency curves, and box plots. You will also learn to choose the most appropriate diagram for a given data set. By developing your skills in data representation, you'll be better equipped to communicate your findings and insights effectively.

Lesson Objectives

  • Understand frequency tables, bar charts, histograms, and pie charts.
  • Learn about cumulative-frequency curves and box plots.
  • Select the diagram that best suits a data set.
  • Construct appropriate diagrams for given data sets.
  • Read and interpret information from cumulative-frequency curves and box plots.

Frequency Tables

What is a Frequency Table?

A frequency table is a way to organize data to show how often each value occurs. It is particularly useful for categorical data, where each category's frequency is counted. A frequency table helps to identify patterns and trends in the data.

Example of a Frequency Table

Let’s say we have the following data representing the number of pets owned by a group of 20 students:

  • 1, 2, 1, 3, 2, 0, 2, 1, 4, 1, 2, 3, 2, 1, 0, 1, 0, 3, 2, 3

To create a frequency table, we first list each unique number of pets and count how many times each occurs:

Number of PetsFrequency
03
16
26
34
41

How to Create a Frequency Table

  1. Identify the Data: Gather the data you want to represent.
  2. List Unique Values: Determine the unique values or categories in the data set.
  3. Count Frequencies: Count how many times each unique value appears in the data.
  4. Organize into a Table: Create a table with two columns: one for the unique values and the other for their corresponding frequencies.

Bar Charts

What is a Bar Chart?

A bar chart is a graphical representation of categorical data using bars of different heights or lengths. It is used to compare different categories against one another. Bar charts can be either vertical or horizontal.

Constructing a Bar Chart

Using the frequency table from the previous example, we can now create a bar chart to visualize the data.

  1. Draw the Axes: Label the x-axis (categories) and the y-axis (frequencies).
  2. Choose the Bar Width: Decide the width of each bar based on the data.
  3. Plot the Bars: For each category, draw a bar that corresponds to its frequency.

Worked Example

Let’s construct the bar chart based on the frequency table:

  • For zero pets, draw a bar up to 3 on the y-axis.
  • For one pet, draw a bar up to 6, and so on.

![Bar Chart Example]

Common Misconceptions

Many students believe that the length of the bar in a bar chart does not have to be proportional to the frequency. This is incorrect; the height of each bar must accurately represent the data being depicted.

Histograms

What is a Histogram?

A histogram is similar to a bar chart but is used for continuous data divided into intervals, or bins. The height of each bin represents the frequency of data points within that range.

Constructing a Histogram

To create a histogram, follow these steps:

  1. Choose the intervals: Determine appropriate intervals for the data.
  2. Count the frequencies within each interval: Just like with a frequency table, count how many data points fall into each bin.
  3. Draw the Histogram: On the x-axis, place the intervals, and on the y-axis, the frequencies. Each bar should touch the next, as they represent continuous data.

Worked Example

Consider the following continuous data representing the heights (in cm) of 20 students:

  • 150, 152, 153, 154, 155, 155, 157, 158, 160, 160, 162, 163, 165, 165, 167, 169, 170, 170, 172, 175
  1. Choose Intervals: For example, we can choose intervals of 5 cm (150-155, 156-160, etc.).
  2. Count Frequencies:
  • 150-155: 5
  • 156-160: 6
  • 161-165: 5
  • 166-170: 3
  • 171-175: 1
  1. Plotting the Histogram:
  • The base of each bar represents the range of heights, and the height represents frequency.

Common Misconceptions

A common misconception is that histograms can be used interchangeably with bar charts. While both display frequency, histograms are specifically designed for continuous data, whereas bar charts are used for categorical data.

Pie Charts

What is a Pie Chart?

A pie chart is a circular graph divided into slices to illustrate numerical proportions. Each slice represents a category's contribution to the total.

How to Draw a Pie Chart

To create a pie chart:

  1. Calculate Total: Find the total sum of frequencies.
  2. Calculate Angle for Each Slice: The measurement for each slice in degrees is given by:

$$ \text{Angle} = \left( \frac{\text{Frequency of the category}}{\text{Total frequency}}

ight) $\times 360^{\circ}$ $$

  1. Draw the Chart: Use a protractor to measure the angles and fill in the slices according to their proportions.

Worked Example

Using our pets data:

$- Total frequency = 20$

  • Calculating angles:
  • For 0 pets: $ $\left($ $\frac{3}{20}

ight) $\times 360^{\circ}$ = 54^{$\circ$}

  • For 1 pet: $ $\left($ $\frac{6}{20}

ight) $\times 360^{\circ}$ = 108^{$\circ$}

  • For 2 pets: $ $\left($ $\frac{6}{20}

ight) $\times 360^{\circ}$ = 108^{$\circ$}

  • For 3 pets: $ $\left($ $\frac{4}{20}

ight) $\times 360^{\circ}$ = 72^{$\circ$}

  • For 4 pets: $ $\left($ $\frac{1}{20}

ight) $\times 360^{\circ}$ = 18^{$\circ$}

Common Misconceptions

A common misunderstanding is that pie charts are suitable for showing changes over time. In fact, pie charts are best for showing parts of a whole at a single point in time, not for displaying trends.

Cumulative-Frequency Curves

What is a Cumulative-Frequency Curve?

A cumulative-frequency curve, or ogive, is a graph that represents the cumulative frequency of a data set. It helps to understand the distribution of data and can show how many data points fall below a particular value.

Constructing a Cumulative-Frequency Curve

  1. Create a cumulative frequency table: Add the frequencies cumulatively as you progress through the data points.
  2. Plot Points: For each data point, plot the cumulative frequency against the upper boundary of the interval.
  3. Draw the Curve: Connect the points with a smooth curve.

Worked Example

Using the height data:

  1. Cumulative Frequency Table:
IntervalFrequencyCumulative Frequency
150-15555
156-160611
161-165516
166-170319
171-175120
  1. Plot the Points: For example, the point for the interval 150-155 is plotted at (155, 5).
  2. Draw the Curve: Connect the points smoothly.

Common Misconceptions

Students often think the curve must always start at the origin. However, the start point depends on the data's range and does not need to begin at zero.

Box Plots

What is a Box Plot?

A box plot, or whisker plot, is a standardized way of displaying data based on a five-number summary: minimum, first quartile (Q1), median (Q2), third quartile (Q3), and maximum. It is excellent for visualizing the spread and skewness of data.

Constructing a Box Plot

  1. Find the Five-Number Summary: Calculate the minimum, Q1, median, Q3, and maximum.
  2. Draw the Box: Create a box from Q1 to Q3, and divide it with a line at the median.
  3. Add Whiskers: Extend lines (whiskers) from the box to the minimum and maximum values.

Worked Example

Using the height data:

  1. Five-Number Summary:

$ - Minimum = 150$

$ - Q1 = 156$

$ - Median = 162$

$ - Q3 = 169$

$ - Maximum = 175$

  1. Plotting the Box: Draw a box from 156 to 169 and a line at 162.
  2. Adding Whiskers: Draw lines from the box to 150 and 175.

Common Misconceptions

A misconception is that box plots cannot show distribution shapes. While box plots do not convey detailed shapes, they effectively show variations and comparisons in medians and quartiles between different data sets.

Conclusion

In this lesson, you learned various methods for representing data, including frequency tables, bar charts, histograms, pie charts, cumulative-frequency curves, and box plots. Each method has its strengths and is suited to particular types of data. It is essential to choose the diagram that best fits the dataset to communicate your findings clearly and accurately. With practice, you'll become proficient in both creating and interpreting these data representations.

Study Notes

  • Frequency Tables: Organize data by counting occurrences.
  • Bar Charts: Compare categorical data using bars.
  • Histograms: Represent continuous data in intervals.
  • Pie Charts: Show proportions of a whole at a single point in time.
  • Cumulative-Frequency Curves: Show how data is distributed across a range.
  • Box Plots: Visualize data spread and identify outliers using a five-number summary.
  • Choosing Diagrams: Always select the most appropriate representation for clarity and accuracy.

Practice Quiz

5 questions to test your understanding