In inception network week 2 in cnn course

abdelkoudos · July 5, 2024, 8:45am

in inception network week 2 in cnn course

there was a concept of a bottle neck layer to reduce the computation in inception nets. the example used 1x1 conv to reduce the dim of n_C from 192 to 16 then used conv 5x5 with 32 filters to increase it again to n_C = 32. why does he used n_C = 16 not 32 or any another number. i get that he started 192 and end goal was 32 so that’s fine by me but the bottle neck layer n_C can be anything how to choose it depends on what. Thank you

gent.spah · July 5, 2024, 10:36am

Its an interesting question, normally you would choose a power of 2 because it resembles the digital systems (binary). Its a bottleneck so it has to be less than 32, can be 16, 8, 4, but the lower this number the less trainable parameters you have so I guess the 16 is an optimal choice…(these are my thoughts).

abdelkoudos · July 5, 2024, 11:28am

So why not to choose 2, or 4 this eill be computational cheaper

gent.spah · July 5, 2024, 11:29am

paulinpaloalto · July 5, 2024, 4:53pm

Well maybe the point is that lowering computational cost is not the only goal here. You are losing information if you make the bottleneck layer too small. If you make it super cheap, but don’t have enough information for the “reinflation” layer to learn all the details it needs, then you don’t get a good enough result.

That would be my interpretation of Gent’s statement here:

Like all hyperparameter choices in ML/DL, you have to experiment to find the “Goldilocks” values. They must have tried some experiments and decided that 16 is “just right” or “close enough”.

But we are doing science here, so if you are ever using an Inception Network to solve a real problem, you can run this experiment yourself and see what happens with your datasets if you use 8 or 4 as the number of filters in the bottleneck layer instead of 16.

Topic		Replies	Views
Inception Networks - doubt regarding the bottleneck layer Convolutional Neural Networks coursera-platform	2	560	July 6, 2022
Inception Network Video Under Case Studies (Week 2) Convolutional Neural Networks coursera-platform	1	338	September 19, 2023
Why do we use Conv2D 16, 32, 64 filters and Dense 128, 256, 512 neurons? Convolutional Neural Networks in TensorFlow week-module-3	1	630	January 10, 2022
Inception module computation cost Convolutional Neural Networks coursera-platform	2	601	June 9, 2021
Inception network motivation Convolutional Neural Networks coursera-platform	2	594	June 9, 2021

In inception network week 2 in cnn course

Related topics