WHen does keras model.fit update weights

tbhaxor · February 19, 2023, 11:43pm

During the fit call, i see for each step the loss value is decreasing so is it ok to assume that after every step (batch, usually 32 samples) the backprop is happening, if this is true shouldnt it be inefficient. I mean at end of the epoch makes sense by taking an avg of all the losses in each step.

The loss value not only change, but start decreasing with increase in step counter of same epoch

rmwkwok · February 20, 2023, 12:13am

Hi @tbhaxor

There is a backprop in every step. However, the mini-batch arrangement and the backprop are NOT for calculating more loss values and then finally average them. The loss calculation is just a monitoring, but it is NOT the cause of all these.

Here are 2 videos on mini-batch gradient descent (Video 1, Video 2) from the Deep Learning Specialization Course 2 Week 2 that discuss why we want to do it mini-batch wise.

Calculating many losses in one epoch is an effect, but not the cause.

Cheers,
Raymond

tbhaxor · February 20, 2023, 1:38am

Yeah it makes sense, I have not enrolled to this course maybe thats why video not working on my end

rmwkwok · February 20, 2023, 2:05am

Try to open them in incognito mode (without login to coursera)?

Topic		Replies	Views
[Week 1] How are the weights updated in backpropagation thorough time? Sequence Models coursera-platform	12	924	July 15, 2023
How does weights get updated? Advanced Learning Algorithms week-module-3	5	690	July 26, 2022
Gradient steps in Mini batch vs batch Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	818	May 18, 2021
Gradient Descent [Logistic Regression] Neural Networks and Deep Learning coursera-platform	3	410	August 17, 2023
Clarification about weights in mini batch processing Sequence Models coursera-platform	2	528	December 3, 2021

WHen does keras model.fit update weights

Related topics