Why MobileNet v1 architecture

zyzhang1130 · April 27, 2021, 1:14am

It appears that the depth-wise separable convolution saves some computational cost by avoiding doing redundant/repeated dot product without any trade-off in performance. My question then is, why don’t use depth-wise separable convolution in general for any architecture whenever it employs CNN? Isn’t that most cost efficient? Also why the specific MobileNet v1 architecture since depth-wise separable convolution is seemingly not limited and affected by the choice of architecture?

vuqpham · April 27, 2021, 2:27am

I may be wrong, but here is what I think. Let say we convolve an img 10x10x3 with a filter 3x3x3 to get an output 8x8x256.
In normal conv, we have 256 different filters 3x3x3. In the depth-wise separable conv, we have only one filter 3x3x3 and 256 filters 1x1x3.

It means in the normal conv, we have many more parameters to train than the other. In many image processing applications, we do need more parameters (weights) to hold different features, although it takes more time to train.

Shubham-Muley · May 11, 2021, 4:55pm

Yes even i think the same about it. As the number of parameters will be less in case of MobileNet and that too using a 1x1xnc filter, I feel It will put a performance barrier for the Neural Network.

zyzhang1130 · May 12, 2021, 1:18am

But aren’t those parameters the same parameters being repeatedly used?

Topic		Replies	Views
Why should mobileNet have similar performance as conventional ConvNet? Convolutional Neural Networks coursera-platform	6	836	August 13, 2021
Confusion about MobileNet Convolutional Neural Networks coursera-platform	1	431	July 26, 2023
Mobile net outputs and normal convs output Convolutional Neural Networks coursera-platform	1	484	March 5, 2022
Question about mobilenet AI Discussions	1	56	May 20, 2023
Depthwise convolution Convolutional Neural Networks coursera-platform	4	681	April 16, 2022

Why MobileNet v1 architecture

Related topics