First programming assignment

smaira98 · December 5, 2024, 5:48am

I am not clear why the backprop for the max function works. It simply broadcasts each value in dA to each element of the array creating an array with the same value in each cell, and adds them up to the dA_prev slice. Why is a max mask then even needed in the first place?

balaji.ambresh · December 5, 2024, 12:59pm

The max value is computed for each portion (i.e. smaller window) of the input and not across the entire input. So, when performing backpropagation, we need to back propagate only for the entries that correspond to the maximum value within that portion. Please see create_mask_from_window which returns a mask with 1s only for positions where the value is equal to the maximum value.

smaira98 · December 5, 2024, 3:30pm

Thanks v much. Yes, I realized later that the dot multiply ensured that the max value was in the position from the mask denoting max position. Still not entirely clear why that is sufficient, but I can understand that knowing the position and value of the max for each window can help determine ultimately which inputs are needed for the outputs, which is useful for training. Will research more.

paulinpaloalto · December 5, 2024, 4:48pm

I may just be misinterpreting your wording there, but note that there are no dot products involved here. Note carefully how the logic works in the operation between the computed mask that Balaji points out and the gradient value. We multiply the gradient value for that position (a scalar) times the mask. Then we add that to the area of the input that is the source of the output on forward propagation. The mask is zero in the positions that do not correspond to the maximum input, so only the elements of the input equal to the maximum value are affected by the gradient.

Topic		Replies	Views
Week 1 Assignment 1 Backpropagation \| Max Pooling mask for multiple maximum values in A_prev Convolutional Neural Networks week-module-1	3	21	June 15, 2025
Week 1 assignment 1 ungraded portion Pool_backward Convolutional Neural Networks coursera-platform	10	606	November 7, 2022
Course 4, Week 1, Optional Excercise 8: pool_backward Convolutional Neural Networks week-module-1 , coursera-platform	5	235	March 13, 2024
[Course 4 Week 1 Assignment 1] "pool_backward()" function returns wrong values Convolutional Neural Networks coursera-platform	2	599	February 14, 2023
C2W1: Programming Assignment on Regularization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	372	October 2, 2023

First programming assignment

Related topics