Differences between ReLU and linear for positive values

Thrasso00 · November 26, 2022, 6:59am

Hi,

I have had some questions regarding the activation functions when viewing the lecture: ‘Why do we need activation functions?’ from week2.
My main doubt may come from the fact that I don’t see the full breakdown of all the calculations underneath each unit of each layer.

The first thing I don’t understand is what is the difference of a ReLU and a linear function in the hypothetical case that all z values are positive. Wouldn’t we be in the same case?
Would we be at this moment in a linear regression case?

balaji.ambresh · November 26, 2022, 8:28am

Please see this link

Christian_Simonis · November 26, 2022, 8:20pm

Hi there,

in case you are in the linear (positive) part of the ReLU function, this just means that the output axon of this neuron is „firing“ and allows the input of the activation b+ \sum_i w_i x_i to pass through as output.

You can consider the ReLU as some kind of „filter“ which passes through positives numbers but blocks everything else to zero. The ability of the neural net to describe and learn non-linear characteristics and cause effects is enabled due the combination of many neurons where the non-linearity is emerging from the negative part of the ReLU function. During the training the „best“ parameters (or weights) can be learned to minimize a cost function.

Since many (really a lot!) neurons are assigned with bias and weights, linked with an activation function, by combination of multiple neurons (as the neural net in total does) this allows to learn highly nonlinear behavior, although the activation function of one neuron itself possesses only a piecewise linear activation function in case of ReLU.
(see also Choice of activation function - #8 by Christian_Simonis)

Best
Christian

Thrasso00 · November 29, 2022, 7:19am

Hi,

Thank you all for your help. Now I understand better what is the purpose of the ReLU function. I have some new questions emanating from the optional Lab - ReLU activation.
In this case, is it better to create a new post or continue in this one?

Kic · January 16, 2023, 7:59pm

Excellent explanation, thanks Christian.

Topic		Replies	Views
RELU vs linear activation Supervised ML: Regression and Classification week-3	4	644	February 15, 2023
Why do we need an activation function? \| ReLU activation Advanced Learning Algorithms week-2	4	622	August 2, 2022
Why do we need Activation function Neural Networks and Deep Learning coursera-platform	4	544	February 16, 2023
Understanding RELU deeply Neural Networks and Deep Learning coursera-platform	6	896	February 5, 2023
Choice of activation function Advanced Learning Algorithms week-2	7	683	November 21, 2022

Differences between ReLU and linear for positive values

Related topics