Question 1
Which of the following best describes a key advantage of Apache Spark over Hadoop MapReduce in terms of in-memory processing?
Question 2
In a distributed Spark application, what is the role of the driver program?
Question 3
Which of the following is an example of a situation where Hadoop MapReduce is more suitable than Apache Spark?
Question 4
Which cloud-managed service is most closely aligned with providing a fully managed Hadoop and Spark ecosystem?
Question 5
In a Spark application, what is the purpose of the RDD lineage graph?