Relu activation

abdou_brk · March 14, 2023, 8:00pm

hello ,
i don’t understand why relu works well as an activation function for hidden layers
because eventhough the function itself is non linear , when i tried to carry out the computations using the relu i ended up with a linear regression model exactly like what sir andrew demonstrates when he explained why we shoud not use linear activations .
i’m very confused about that if someone could clarify this for me i would be very grateful .

Christian_Simonis · March 14, 2023, 8:04pm

Hi @abdou_brk,

You can consider the ReLU function as some kind of „filter“ which passes through positives numbers but blocks everything else to zero. The ability of the neural net to describe and learn non-linear characteristics and cause effects is enabled due the combination of many neurons where the non-linearity is emerging from the „transition to negative part“ of the ReLU function. During the training the „best“ parameters (or weights) can be learned to minimize a cost function, see also this tread:

Understanding RELU deeply - #2 by Christian_Simonis

Source: Choice of activation function - #8 by Christian_Simonis

Best regards
Christian

Topic		Replies	Views
Differences between ReLU and linear for positive values Advanced Learning Algorithms week-module-2	4	734	January 16, 2023
Understanding RELU deeply Neural Networks and Deep Learning coursera-platform	6	906	February 5, 2023
Choice of activation function Advanced Learning Algorithms week-module-2	7	688	November 21, 2022
Doubt about Relu activation in hidden layer Advanced Learning Algorithms week-module-2	3	826	January 29, 2023
Isn't Relu just a lineer regression function for z>=0 Supervised ML: Regression and Classification week-module-3	6	687	December 24, 2022

Relu activation

Related topics