Differences between ReLU and linear for positive values

Christian_Simonis · November 26, 2022, 8:20pm

Hi there,

in case you are in the linear (positive) part of the ReLU function, this just means that the output axon of this neuron is „firing“ and allows the input of the activation b+ \sum_i w_i x_i to pass through as output.

You can consider the ReLU as some kind of „filter“ which passes through positives numbers but blocks everything else to zero. The ability of the neural net to describe and learn non-linear characteristics and cause effects is enabled due the combination of many neurons where the non-linearity is emerging from the negative part of the ReLU function. During the training the „best“ parameters (or weights) can be learned to minimize a cost function.

Since many (really a lot!) neurons are assigned with bias and weights, linked with an activation function, by combination of multiple neurons (as the neural net in total does) this allows to learn highly nonlinear behavior, although the activation function of one neuron itself possesses only a piecewise linear activation function in case of ReLU.
(see also Choice of activation function - #8 by Christian_Simonis)

Best
Christian

Topic		Replies	Views
Relu activation NLP with Probabilistic Models week-module-2	1	565	March 14, 2023
Isn't Relu just a lineer regression function for z>=0 Supervised ML: Regression and Classification week-module-3	6	686	December 24, 2022
RELU vs linear activation Supervised ML: Regression and Classification week-module-3	4	646	February 15, 2023
Understanding RELU deeply Neural Networks and Deep Learning coursera-platform	6	901	February 5, 2023
How non linear is ReLU? Neural Networks and Deep Learning coursera-platform	4	799	March 17, 2023

Differences between ReLU and linear for positive values

Related topics