C4 W2 MobileNet Architecture

Is it right to say that in this MobileNet v2 Bottleneck, the Expansion and Pointwise operations are 1x1 Convolution Operations?

Also, the same can be said for Depthwise Separable Convolution. Where the first step is Depthwise Convolution and the second step is 1x1 Convolution a.k.a Pointwise Convolution.

Is this right?

Hey @Ammar_Jawed,

Yes you are correct!.

In MobileNetV2:

  • Expansion and Pointwise: It uses a 1x1 convolution to increase or decrease the dimensionality.
  • Depthwise Separable Convolution: It involves Depthwise Convolution (e.g., 3x3 or 5x5) followed by Pointwise Convolution (1x1 Convolution) to capture and mix features.