Why do we need Activation function

Med-akraou · February 16, 2023, 5:12pm

If we have positive values, the ReLU function is identical to Linear Function.
Then, in cases where our inputs are all positive features, and all intial values of weights W and bias b are positive deep neural network is going still linear model isn’t it ?
Even we have very small inputs negative Relu is a semi linear function, the majority of calculs will still linear. Could you explain more in this topic?

TRAN_KHANH1 · February 16, 2023, 5:28pm

The ReLU function is non-linear so a linear combination of those functions will expected to be non-linear.

In training, when the learned function is still linear (but the desired function is non-linear), then the gradient descent (or any other optimization algorithm) will help to move it close to that function.

AbdElRhaman_Fakhry · February 16, 2023, 5:44pm

Hi @Med-akraou

The Relu Acitvation function isn’t linear functions, it’s simple non-linear functions, and if we assume that all the features , and intial values of weights are positive:

The optimization algorithm will adjust and tune these values to be more closer to the output
we didn’t built the linear regression we built abig complex model to fit complex data like images , sounds with more than 1 hidden layer so the combination of these layer must lead to negative values and the output of relu function will be equal 0

Why we use the simple non-linear activation function this was discussed in this thread by Mentor @paulinpaloalto

Cheers,
Abdelrahman

Med-akraou · February 16, 2023, 6:10pm

Thank you @AbdElRhaman_Fakhry

Ji_Huang · February 16, 2023, 8:44pm

There is a theorem called universal approximation theorem stating that with the combination of affine mappings (W*A_prev + b) and nonlinear activation functions, such as sigmoid or relu, then the forward network can approximate almost any (some sort of continuous) functions. Without these nonlinear activation functions, it’s hard.

Formal proof needs some maths such as “measure” theory and functional analysis.

Topic		Replies	Views
C1_W3-Non-Linear_Activation_Function Neural Networks and Deep Learning coursera-platform	1	550	May 18, 2021
Relu activation NLP with Probabilistic Models week-module-2	1	565	March 14, 2023
Understanding RELU deeply Neural Networks and Deep Learning coursera-platform	6	902	February 5, 2023
Choice of activation function Advanced Learning Algorithms week-module-2	7	687	November 21, 2022
Why do we need an activation function? \| ReLU activation Advanced Learning Algorithms week-module-2	4	622	August 2, 2022

Why do we need Activation function

Related topics