C1W1_Difference between GlobalAveragePooling and AveragePooling

Maverick06 · December 20, 2021, 5:54pm

In DenseNet architecture we have used GlobalAveragePooling2D . Can someone explain what is difference between GlobalAveragePooling2D and AveragePooling ?

Thanks.

ai_curious · December 20, 2021, 6:22pm

You can see the entire list of pooling layers here [Pooling layers]

If you compare the arguments accepted by AveragePooling2D [AveragePooling2D layer]
versus GlobalAveragePooling2D [GlobalAveragePooling2D layer] you will see that the former accepts a pool_size argument :
pool_size : integer or tuple of 2 integers, factors by which to downscale

GlobalAveragePooling2D does not.

Also provided on those linked pages are the differences between the output shapes for the two.

Average:
Output shape

If data_format='channels_last' : 4D tensor with shape (batch_size, pooled_rows, pooled_cols, channels) .
If data_format='channels_first' : 4D tensor with shape (batch_size, channels, pooled_rows, pooled_cols) .

Global:
Output shape

If keepdims =False: 2D tensor with shape (batch_size, channels) .
If keepdims =True:
- If data_format='channels_last' : 4D tensor with shape (batch_size, 1, 1, channels)
- If data_format='channels_first' : 4D tensor with shape (batch_size, channels, 1, 1)

bharathikannan · December 21, 2021, 9:27am

@Maverick06, The main use of pooling layer is to reduce the number of features. In average pooling we will only do for small blocks of the input. But in global average pooling we will do for the whole input.

For a single image, without considering the batch size let’s consider an input of dimension 4 (height) x 4 (width) x 4 (channels), if you apply global average pooling it will take the average value for the whole 4 (height) x 4 (width). It will take average for all 16 values and it will be done for all 4 channels seperately. The result will be 1 x 4 (channels), if you need to need to preserve dimension, you can add keep_dims as true and you will get (1 x 1 x 4). Now you can understand why it’s called global. It is mainly used to replace fully connected layers in CNNs as it looks more like that.

In average pooling, you will not take for whole height and width dimension, you will do for small blocks in it by mentioning the stride and pool_size. So if you mention stride as 2 and poolsize as 2 x 2 and if you apply for this example you will get 2(pooled_height) x 2(pooled_width) x 4(channels) as output.

Maverick06 · December 30, 2021, 11:58am

Thanks @bharathikannan , got it.

Topic		Replies	Views
GlobalAveragePooling2D Advanced Computer Vision with TensorFlow week-1	1	598	February 1, 2023
Error in test: Transfer learning with mobileNet_V2 Convolutional Neural Networks	2	505	July 14, 2022
Week 1 lecture on Resnet50 usage Advanced Computer Vision with TensorFlow week-1	2	284	February 16, 2024
C3_W3_assignment NLP with Sequence Models week-3	4	58	July 21, 2024
Why Flatten after GlobalAveragePooling2D in C3 W1 Lab2 Classifier? Advanced Computer Vision with TensorFlow week-1	11	629	November 28, 2022

C1W1_Difference between GlobalAveragePooling and AveragePooling

Related topics