Linear Activation Function Hidden Layer

Anbu · May 18, 2021, 2:13pm

Hi Sir,

In Lecture video (why do we need Non linear activation function ), we cannot understand the below statement. can u please what does it meaning ?

But other than that, using a linear activation function in the hidden layer
except for some very special circumstances relating to compression that we’re
going to talk about using the linear activation function is extremely rare

jonaslalin · May 20, 2021, 1:19pm

Hi @Anbu,

I am not sure what Andrew is referring to when he mentions compression in passing. The key takeaway from that lecture is to use non-linear activation functions, except for regression problems. In that case, it makes sense to use the identity activation function in the output layer. However, if predicting housing prices, as Andrew mentions, it might make sense to use the ReLU again, if prices never go below 0.

Anbu · May 25, 2021, 11:10am

Hi @jonaslalin , You mentioned use non linear activation function , expect for regression problems. But Proff andrew ng telling in the lecture, use non activation function in the hidden layer for regression problem . Is it for any specific reason ?

jonaslalin · May 25, 2021, 4:30pm

Yes, that is correct. I am only referring to the last layer, i.e., the output layer. Every layer before that should be non-linear.

Topic		Replies	Views
Why do you need Non-Linear Activation Functions? Neural Networks and Deep Learning	3	683	March 15, 2022
Why use ReLU for hidden layers when output layer is linear? Advanced Learning Algorithms week-2	3	323	October 25, 2023
Neural Network functions Advanced Learning Algorithms week-2	3	476	April 22, 2023
Why do we need non linear activation function? Neural Networks and Deep Learning	4	1184	August 5, 2021
Activation Functions, (conceptually) Neural Networks and Deep Learning	10	596	November 2, 2022

Linear Activation Function Hidden Layer

Related topics