I see that the compute function is not used in the training.
I understand that compute_gradient() is a derivative of the loss function. So does that mean we need not calculate loss at all ?
I see that the compute function is not used in the training.
I understand that compute_gradient() is a derivative of the loss function. So does that mean we need not calculate loss at all ?
Hi salih-g,
For the model to be trained only the gradient needs to be computed. But it is informative to also calculate and print the loss to see how the training is progressing.