Dropout regularization

ajaykumar3456 · May 8, 2021, 12:54pm

In Inverted dropout, we want some units to be zeroed out so that the complexity of the Neural Network decreases.
After you multiply a3 with d3(A random boolean matrix where elements are less than keep_prob)
, you get a matrix a3 with some elements randomly zeroed out which means the zeroed element position indicates that particular hidden unit is eliminated.

But the reason behind scaling (a3/=0.8) is that every value in the matrix is being affected and that should not be the case

Please correct me If I am wrong!

nramon · May 10, 2021, 10:16am

Hi, @ajaykumar3456.

You’ll actually have to implement this in week one’s second assignment! I’m pretty sure you won’t have any problems with it, and I think you should probably remove the code

I tried to explain the reason behind scaling here. Let me know if that helped!

Topic		Replies	Views
Course 2 -- Week 1 -- Dropout Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	747	June 28, 2021
Regularization by Inverted Dropout Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	691	August 12, 2021
Week 1, Dropout Regularization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	658	June 8, 2022
A lecture issue in dropout regularization implementation in week 1 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	717	December 9, 2022
Inverted Dropout Improving Deep Neural Networks: Hyperparameter tun coursera-platform	22	1845	July 27, 2023

Dropout regularization

Related topics