1. Data Collection
Data Ethics — Quiz
Test your understanding of data ethics with 5 practice questions.
Practice Questions
Question 1
A researcher is conducting a study on the effectiveness of a new drug. They collect extensive patient data, including medical history, genetic information, and treatment responses. To ensure patient privacy while allowing for future research, the team decides to pseudonymise the data. Which of the following statements best describes the nature of pseudonymised data?
Question 2
A social media platform collects vast amounts of user data, including posts, likes, and location information. The platform's terms of service state that this data will be used to 'improve user experience.' However, the company later decides to sell this data, in an aggregated and anonymised form, to third-party advertisers for targeted marketing. Which ethical principle is most directly challenged by this action?
Question 3
In a longitudinal study tracking the health of individuals over several decades, researchers collect highly sensitive medical and lifestyle data. To mitigate the risk of re-identification over such a long period, which advanced anonymisation technique would be most effective in maintaining privacy while still allowing for meaningful statistical analysis?
Question 4
A government agency is developing a new algorithm to predict the likelihood of individuals defaulting on loans. The algorithm is trained on historical data that disproportionately represents certain demographic groups as higher risk, even when controlling for other financial factors. If this algorithm is deployed, which ethical principle is most likely to be violated?
Question 5
A researcher is conducting a study that involves collecting genetic data from participants. Before collecting any samples, the researcher provides a detailed consent form explaining the purpose of the study, the types of data to be collected, how the data will be stored, and the potential risks and benefits. However, the consent form is written in highly technical jargon, making it difficult for the average participant to understand. Which aspect of informed consent is primarily compromised in this situation?
