Topic 3: Probability Distributions

Lesson 3.1: Discrete And Continuous Random Variables

Official syllabus section covering Lesson 3.1: Discrete and continuous random variables within Topic 3: Probability Distributions: Terms for variability: random, discrete, continuous, dependent and independent variables.; Calculating probabilities and finding expected values, variances and standard deviations for discrete distributions defined in words, in a table or by a probability function..

Lesson 3.1: Discrete and Continuous Random Variables

Introduction

Welcome to Lesson 3.1 of A-Level Statistics, where we will explore the fundamental concepts of discrete and continuous random variables. This lesson aims to provide a thorough understanding of these concepts, leading you to confidently calculate probabilities, expected values, variances, and standard deviations for different kinds of distributions. By the end of this lesson, you should be able to classify variables, perform calculations, and apply your knowledge to real-world scenarios.

Learning Objectives

  • Understand terms related to variability: random, discrete, continuous, dependent, and independent variables.
  • Calculate probabilities, expected values, variances, and standard deviations for discrete distributions.
  • Recognize properties of continuous distributions, specifically the rectilinear (uniform) distribution.
  • Find $E(X)$, $Var(X)$, and the standard deviation for discrete random variables given by tables or functions.
  • Model real-world situations with discrete random variables.

Understanding Random Variables

A random variable is a numeric outcome of a random phenomenon. It is a function that assigns a real number to each possible outcome of a random process. Random variables can be classified into two main types: discrete and continuous.

Discrete Random Variables

A discrete random variable is one that can take on a countable number of distinct values. Examples include the number of students in a classroom, the result of rolling a die, or the number of heads obtained when flipping a coin multiple times.

Example of Discrete Random Variable

Consider the random variable $X$ representing the number of red balls drawn from a bag containing 3 red balls and 7 blue balls when drawing without replacement. The possible outcomes for $X$ are $0, 1, 2, or $3$, making it a discrete random variable.

Calculating Probabilities for Discrete Random Variables

To calculate probabilities for a discrete random variable, you can create a probability mass function (PMF). This function gives the probabilities of each possible value of the discrete random variable.

Worked Example: PMF Calculation

Suppose we roll a fair six-sided die. Let $X$ be the outcome of the roll. The PMF is as follows:

  • $P(X = 1) = \frac{1}{6}$
  • $P(X = 2) = \frac{1}{6}$
  • $P(X = 3) = \frac{1}{6}$
  • $P(X = 4) = \frac{1}{6}$
  • $P(X = 5) = \frac{1}{6}$
  • $P(X = 6) = \frac{1}{6}$

We can see that each outcome has an equal probability of occurring.

Continuous Random Variables

A continuous random variable, in contrast, can take on any value within a given range or interval. It often represents measurements like height, weight, or temperature. Because the number of possible values is infinite, we work with probability density functions (PDFs) instead of PMFs.

Properties of Continuous Distributions

  1. Probability Density Function (PDF): For a continuous random variable, the probability of the variable falling within a specific interval is determined by the area under the curve of its PDF. The total area under the curve equals 1.
  2. No Individual Probabilities: Unlike discrete variables, we cannot assign probabilities to exact values. Instead, we consider the probability over an interval. For example, the probability that $X$ is between $a$ and $b$ can be expressed as:

$$P(a < X < b) = \int_{a}^{b} f(x) \, dx$$

  1. Cumulative Distribution Function (CDF): The CDF of a continuous random variable $X$ is defined as the probability that $X$ takes on a value less than or equal to $x$:

$$F(x) = P(X \leq x) = \int_{-\infty}^{x} f(t) \, dt$$

Worked Example: Uniform Distribution

Consider a uniform distribution where a continuous random variable $X$ can take values from $0$ to $10$. The PDF for a uniform distribution is constant within the interval and is given by:

$$f(x) = \begin{cases} \frac{1}{b-a} & \text{for } a \leq x \leq b \ 0 & \text{otherwise} \end{cases}$$

For our example, $a = 0$ and $b = 10$, thus:

$$f(x) = \begin{cases} \frac{1}{10} & \text{for } 0 \leq x \leq 10 \ 0 & \text{otherwise} \end{cases}$$

To find the probability that $X$ falls between 3 and 7, we compute:

$$P(3 < X < 7) = \int_{3}^{7} \frac{1}{10} \, dx = \frac{1}{10} \cdot (7 - 3) = \frac{4}{10} = 0.4$$

Expected Value, Variance, and Standard Deviation

The expected value $E(X)$ of a random variable provides a measure of the center of its distribution. The variance $Var(X)$ measures the spread of the variable around the expected value, and the standard deviation is the square root of the variance.

Calculating Expected Value, Variance, and Standard Deviation for Discrete Random Variables

If $X$ is a discrete random variable with possible values $x_1, x_2, ..., x_n$ and corresponding probabilities $p_1, p_2, ..., p_n$, the formulas are as follows:

$$E(X) = \sum_{i=1}^{n} x_i \cdot p_i$$

$$Var(X) = E(X^2) - (E(X))^2$$

Where:

$$E(X^2) = \sum_{i=1}^{n} x_i^2 \cdot p_i$$

Worked Example: Expectation Calculation for $X$

Using the earlier example of rolling a die:

  • Possible values: $1, 2, 3, 4, 5, 6
  • Corresponding probabilities: $p = \frac{1}{6}$ for each value.

First, we would calculate:

$$E(X) = \sum_{i=1}^{6} i \cdot \frac{1}{6} = \frac{1}{6}(1 + 2 + 3 + 4 + 5 + 6) = \frac{21}{6} = 3.5$$

Next, we calculate $E(X^2)$:

$$E(X^2) = \sum_{i=1}^{6} i^2 \cdot \frac{1}{6} = \frac{1}{6}(1 + 4 + 9 + 16 + 25 + 36) = \frac{91}{6}$$

Finally, we find the variance:

$$Var(X) = E(X^2) - (E(X))^2 = \frac{91}{6} - (3.5)^2 = \frac{91}{6} - \frac{49}{6} = \frac{42}{6} = 7$$

And the standard deviation:

$$SD(X) = \sqrt{Var(X)} = \sqrt{7}$$

Summary of Key Concepts

  1. Random Variables: Numeric outcomes of random phenomena categorized as discrete or continuous.
  2. Discrete Variables: Countable possible outcomes with a defined PMF.
  3. Continuous Variables: Uncountable outcomes with a defined PDF.
  4. Calculating Moments: Expectation, variance, and standard deviation provide insight into the behavior of random variables.

Conclusion

In this lesson, we have delved deeply into discrete and continuous random variables, their properties, how to calculate probabilities using PMFs and PDFs, and how to find the expected value, variance, and standard deviation. Understanding these concepts is essential for further studies in statistics and for applying probability theory to real-world situations.

Study Notes

  • Random Variable: A numeric outcome from a random process.
  • Discrete Random Variable: A variable with countable outcomes.
  • Continuous Random Variable: A variable with uncountable outcomes within an interval.
  • Probability Mass Function (PMF): Function for discrete random variables indicating the probability of each outcome.
  • Probability Density Function (PDF): Function for continuous random variables representing probabilities over intervals.
  • Expected Value $E(X)$: Average or mean of random variable values.
  • Variance $Var(X)$: Measure of the spread of the random variable around its expected value.
  • Standard Deviation $SD(X)$: Square root of the variance indicating the dispersion of data points.

Practice Quiz

5 questions to test your understanding

Lesson 3.1: Discrete And Continuous Random Variables — A-Level Statistics | A-Warded