Regularization

Kamal_Nayan · July 15, 2023, 5:58pm

Please explain me why have we divided a3 by keep-prob in this slide .
If possible please explain this with an example.

Kamal_Nayan · July 15, 2023, 6:06pm

Also when we implement inverted dropout as in the slide , If we think individually on 3rd layer for each training example , then the no of units dropped out might be different resulting to a different neural network or each training example .
I think this might be a problem as we have to come to a neural network eventually .
Explain me if this is wrong

paulinpaloalto · July 15, 2023, 7:09pm

Prof Ng discusses that point in the lecture. If you missed that, I suggest you rewind and watch it again. You can use the interactive transcript to find the relevant part of the lecture.

Here’s a previous thread about this point as well. You can read from that post forward through the thread. Interestingly in the original paper by Geoff Hinton’s group, they don’t do it that way and it makes things quite a bit more complicated.

Here’s another thread about it.

And here’s one where it actually shows the effect on the L2-norm of the activation output.

paulinpaloalto · July 15, 2023, 7:12pm

You’re right to observe that not every sample within the given batch (or minibatch) is treated the same in any given iteration. Here’s a thread which talks about that point and also if you read all the way through shows some experiments which demonstrate that it doesn’t make that much difference if you treat all the samples in the batch the same.

Topic		Replies	Views
Regularization by Inverted Dropout Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	687	August 12, 2021
A lecture issue in dropout regularization implementation in week 1 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	713	December 9, 2022
Inverted dropout Intuition? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	671	May 24, 2022
Week 1, Dropout Regularization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	654	June 8, 2022
Course 2 -- Week 1 -- Dropout Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	738	June 28, 2021

Regularization

Related topics