Question 1
Which of the following best describes the concept of convexity in optimization?
Question 2
In the context of gradient descent, what is the effect of a low learning rate?
Question 3
What is the primary purpose of using momentum in gradient descent?
Question 4
Which of the following describes stochastic gradient descent (SGD)?
Question 5
What is the effect of using a high learning rate in gradient descent?