1. Descriptive Statistics
Data Cleaning — Quiz
Test your understanding of data cleaning with 5 practice questions.
Practice Questions
Question 1
Which of the following is a key step in data cleaning that involves identifying and addressing inconsistencies in data formats or values?
Question 2
When dealing with a dataset where a numerical variable has values that are clearly outside the expected range (e.g., negative ages), what is the most appropriate initial action during data cleaning?
Question 3
A dataset contains a variable 'Temperature' with entries in both Celsius and Fahrenheit, but without a clear unit indicator. What data cleaning technique is most suitable to ensure consistency?
Question 4
If a dataset has a column 'Customer ID' where some entries are missing, and these missing values are not random but systematically absent for a specific group of customers, what type of missing data is this likely to be?
Question 5
When preparing data for statistical analysis, what is the primary benefit of converting categorical variables into numerical representations (e.g., using one-hot encoding or label encoding)?
