4. Topic 4(COLON) Averages and Measures of Location

Lesson 4.2: The Median

Official syllabus section covering Lesson 4.2: The median within Topic 4: Averages and Measures of Location: Finding the median by ordering the data and locating the middle value.; Handling an even number of values by averaging the middle two..

Lesson 4.2: The Median

Introduction

In this lesson, students, we will explore the concept of the median, an essential measure of central tendency in statistics. The median represents the middle value in a dataset when it is ordered, and it helps us understand the data's distribution without being skewed by extreme values. By the end of this lesson, you will be able to:

  • Find the median by ordering the data and identifying the middle value.
  • Handle a dataset with an even number of values by averaging the two middle values.
  • Estimate the median from cumulative frequency distributions.
  • Understand why the median is a robust measure that resists the effect of extreme values.
  • Calculate the median for datasets containing both odd and even counts.

The Median

The median is a measure that indicates the middle point of a dataset, where half the values are above it, and half the values are below it. Unlike the mean, which can be affected by outliers, the median offers a more reliable summary for skewed distributions. To find the median, we follow a systematic approach:

  1. Order the Data: First, we need to sort the dataset in either ascending or descending order.
  2. Locate the Middle Value:
  • If the number of observations (n) is odd, the median is the value located at position $\frac{n + 1}{2}$.
  • If the number of observations is even, the median is the average of the two middle values located at positions $\frac{n}{2}$ and $\frac{n}{2} + 1$.

Example 1: Finding the Median for Odd Count

Consider the dataset: 3, 7, 1, 4, 9.

Step 1: Order the Data

Ordered dataset: 1, 3, 4, 7, 9.

Step 2: Count the Observations

Here, $n = 5$, which is odd.

Step 3: Find the Median

The median position is $\frac{5 + 1}{2} = 3$.

The median is the 3rd value: 4.

Example 2: Finding the Median for Even Count

Now consider the dataset: 2, 5, 1, 8.

Step 1: Order the Data

Ordered dataset: 1, 2, 5, 8.

Step 2: Count the Observations

Here, $n = 4$, which is even.

Step 3: Find the Median

The two middle positions are $\frac{4}{2} = 2$ and $\frac{4}{2} + 1 = 3$.

The median is the average of the 2nd and 3rd values:

$$\text{Median} = \frac{2 + 5}{2} = \frac{7}{2} = 3.5$$

Cumulative Frequency and the Median

Cumulative frequency is a useful tool when dealing with large datasets or when data is presented in a grouped format. Cumulative frequency helps us understand how many observations fall below a certain value, allowing us to estimate the median effectively.

Example 3: Estimating the Median from Cumulative Frequency

Consider the following cumulative frequency table:

Value (x)Cumulative Frequency
13
28
312
415
520

Step 1: Find the Total Observations

The total observations is 20, thus $n = 20$ (even).

Step 2: Determine the Median Position

The median position is:

$$\frac{20}{2} = 10$$

Step 3: Identify the Median from the Cumulative Frequency

Looking at the cumulative frequency, we see:

  • The value at position 10 falls between 2 (cumulative frequency of 8) and 3 (cumulative frequency of 12).
  • Thus, the median is 3.

The Robustness of the Median

One notable characteristic of the median is its resilience against outliers. While the mean can be significantly affected by extreme values, the median remains an accurate measure of central tendency regardless of them. For instance:

Example 4: Effect of Outliers

Consider the dataset: 1, 2, 3, 100.

  • Mean:

$$\text{Mean} = \frac{1 + 2 + 3 + 100}{4} = \frac{106}{4} = 26.5$$

  • Median:

Ordered dataset: 1, 2, 3, 100.

Here, $n = 4$ (even), thus:

$$\text{Median} = \frac{2 + 3}{2} = 2.5$$

The mean is greatly affected by the outlier (100), resulting in a misleading central value. In contrast, the median provides a more accurate representation of the data's central tendency, which is indicated by most of the values being concentrated around 1, 2, and 3.

Conclusion

In summary, students, the median provides a valuable summary of a dataset, particularly when the data contains outliers or is not symmetrically distributed. By learning how to compute the median correctly and understanding its significance, you can effectively summarize and interpret data in various contexts.

Study Notes

  • Definition of Median: The middle value of an ordered dataset.
  • Procedure for Finding Median: 1) Order the dataset, 2) Identify the middle value or average the two middle values based on whether n is odd or even.
  • Cumulative Frequency: Can be used to estimate the median in grouped data.
  • Robustness: The median is not affected by extreme values in the dataset, making it a reliable measure of central tendency.

Practice Quiz

5 questions to test your understanding