Course 4, week 2 : why resnets work (relu activation function and output)

Nanini · February 24, 2022, 1:12am

Hello,

In the video we have : a[l+2] = g (a[l]) = a[l]

I understand the first part, but why is a[l+2] = a[l] when we apply the relu activation function ? I understood that relu returns a non-negative function but still I do not understand how we can say that this is true. Please help !

Thank you

paulinpaloalto · February 24, 2022, 8:17pm

Prof Ng mentions this a bit earlier in the video: he is assuming that the network uses ReLU at all layers, so that means that all values of a^{[l]} \geq 0. So if you apply ReLU to that input, the output is the same. ReLU only changes the values that are negative, right?

Topic		Replies	Views
Unable to get intution on how this works Convolutional Neural Networks coursera-platform	2	520	April 29, 2022
Can ResNets work (efficiently) with other activation functions rather than ReLU? Convolutional Neural Networks coursera-platform	3	521	April 4, 2023
W4_Forward and Backward Propagation - 2 ReLU's in a row? Neural Networks and Deep Learning coursera-platform	3	506	December 25, 2022
Convolution followed by relu Convolutional Neural Networks week-module-1 , week-module-2 , week-module-3 , week-module-4 , coursera-platform	8	174	April 23, 2024
Question for Coursera DL CNN Week2: Why ResNets Work? AI Discussions ai-discussions	3	79	March 6, 2024

Course 4, week 2 : why resnets work (relu activation function and output)

Related topics