C2W3 How does Dropout layer work?

Anatoliy_Elsukov · July 9, 2023, 12:40pm

hi.

this week introduced concept of droupout as way to solve overfitting problem. In practical example C2_W3_Lab_1_transfer_learning.ipynb network builds on top of mixed_7 for InceptionV3.

Network code is:

This is original code.

Accuracy curves looks:

So, this looks like expected.

But if I comment out dropout layer:

My expectation was that verification accuracy should go down, but I see this picture:

What’s wrong? What reason of dropout layer here?

carlosrl · July 9, 2023, 1:47pm

Hi @Anatoliy_Elsukov
It seems your model is overfitting. So, when you add dropout 20% rate, it introduces a regularization effect by randomly dropping out neurons during training. This encourages the network to learn more robust and generalizable representations. The dropout can lead to fluctuations in the accuracy values as the model learns different patterns or focuses on different features in different training iterations.

Anatoliy_Elsukov · July 9, 2023, 3:09pm

hi @carlosrl
thank you for your respond, but it is hard to call model overfitted: validation accuracy higher then training accuracy. anyway pictures are very close and I don’t see any benefits of dropout in this model.

TMosh · July 9, 2023, 5:46pm

A difference of 2% is not generally considered significant.

carlosrl · July 9, 2023, 7:52pm

@Anatoliy_Elsukov , you are right. Checking the second figure we can see that the two graphs are very close at the end.

saifkhanengr · July 10, 2023, 4:54am

Hello @Anatoliy_Elsukov! You are right that using dropout doesn’t benefit much in this case. But try using dropout with 0.5, instead of 0.2, and then comment out the dropout layer and compare the results. Maybe you see a significant difference.

PS: Do this experiment after submission of your assignment.

Topic		Replies	Views
C2W3 unexpected overfitting please help Convolutional Neural Networks in TensorFlow	1	333	February 4, 2024
C2W3 Assignment: I can't get over 99.9% accuracy Convolutional Neural Networks in TensorFlow week-3	17	103	August 30, 2024
Course 2 Week 1 Programming Assignment Regularization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	694	September 10, 2021
Dropout Layer and some extra questions! Natural Language Processing in TensorFlow	3	376	December 21, 2022
Exploring Dropouts Convolutional Neural Networks in TensorFlow week-3	1	517	April 15, 2022

C2W3 How does Dropout layer work?

Related topics