4. Health Data Analytics
Data Quality — Quiz
Test your understanding of data quality with 5 practice questions.
Practice Questions
Question 1
In a patient dataset containing 12,000 records, 300 records are identified as exact duplicates. What is the duplication rate as a percentage?
Question 2
Comparing diagnosis codes in a dataset against the SNOMED CT controlled vocabulary assesses which data quality dimension?
Question 3
In a boxplot used for detecting outliers in numeric health data, which rule defines the whiskers?
Question 4
Lab test values are missing more frequently in sicker patients, and missingness depends on other observed variables. Which imputation method is most appropriate?
Question 5
Which key performance indicator (KPI) would best measure the completeness dimension in a data quality dashboard for patient records?
