Pooling operations: why not adjust parameters through gradient descent?

Zijun_Liu · August 1, 2024, 8:49pm

In the video, Pooling Layers, Prof Ng says that there are no parameters to learn. Why is that? Is it

technically impossible
or
a design choice for computational efficiency which is the purpose of pooling?

gent.spah · August 2, 2024, 10:22am

Probably this, they just condense information in a smaller data size!

paulinpaloalto · August 2, 2024, 3:41pm

There are literally no trainable parameters in a pooling layer: all it does is apply the chosen algorithm to the inputs (either average or max) in a fixed way based on the filter size and stride that is specified. There are no “weights” in the pooling layer. But that does not mean that backward propagation does not pass through that layer: the gradients from the later steps project backward through the pooling layer. They are either applied as averages to all the inputs or only to the max elements of the input layer, depending on the definition of the pooling layer. That will be covered in the first assignment for C4 W1.

Topic		Replies	Views
Gradient descent with Max Pooling (DSL 4 - Week 1) Convolutional Neural Networks coursera-platform	2	740	September 29, 2022
The Basics of ConvNets Convolutional Neural Networks coursera-platform	3	940	May 15, 2021
Question on the benefit of CNN: sparsity? Convolutional Neural Networks coursera-platform	3	585	December 12, 2022
Why convolution operation? Convolutional Neural Networks coursera-platform	5	723	September 28, 2022
A doubt on Pooling layer Convolutional Neural Networks coursera-platform	1	377	September 19, 2023

Pooling operations: why not adjust parameters through gradient descent?

Related topics