Which of the following statements is false about gradient descent algorithms?

Question

Tresor Garcia · Accepted Answer

When GPUs are used for parallel computing, the mini-batch gradient descent (MBGD) takes less time than the stochastic gradient descent (SGD) to complete an epoch.

Tresor Garcia · Answer

Each time the global gradient updates its weight, all training samples need to be calculated.

Tresor Garcia · Answer

The global gradient descent is relatively stable, which helps the model converge to the global extremum.

Tresor Garcia · Answer

When there are too many samples and GPUs are not used for parallel computing, the convergence process of the global gradient algorithm is time-consuming.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 45 - H13-311_V3.5 discussion

Suggested answer: B

0 comments