Training examples with normalized features can speed up gradient descent compared to unnormalized features.
Increasing the number of training examples can improve the accuracy of prediction of a learning model.
Question:
By increasing the number of training examples with normalized features, will the gradient descent be speeding up or slowing down when finding for the minimum point of cost function?