What function is used when no activation function?

Jinyan_Liu · August 18, 2023, 1:31pm

According to tf.keras.layers.Dense | TensorFlow v2.13.0, if no activation is applied, it is: a(x) = x. Then why bother to even have this layer with no activation function?

And by calculating the params for the user model, clearly the final layer without activation function has W and b values. So there is some activation function applied?

balaji.ambresh · August 18, 2023, 1:56pm

When no activation function is specified, layer output is w^T \cdot X + b

Linear activation is useful when output layer is required to predict regression output in range [-\infty, \infty] (eg: temperature at a desert based on number of hours since midnight).

Jinyan_Liu · August 18, 2023, 2:11pm

So when no activation function is specified, Linear activation function is used.(default)

Then how to understand this:

from TensorFlow doc? Isn’t it meaning when no activation function is specified, y=x ?

balaji.ambresh · August 18, 2023, 2:36pm

Activation is applied on top of the affine transformation i.e. w^T \cdot X + b. Think of it as the following pseudocode:

class Dense:
  def forward_pass(self, X):
    output = w @ X + b
    if self.activation is not None:
      output = self.do_activation(output)
    return output

Jinyan_Liu · August 18, 2023, 10:50pm

Thank you so much!

So if specify Linear activation function, it’s actually == no activation function?

rmwkwok · August 18, 2023, 11:54pm

Yes.

Jinyan_Liu · August 19, 2023, 10:54pm

Thanks!

Topic		Replies	Views
Output Layer's Activation Function: Why does the lecturer not specify the Activation function in the output layer? Sequences, Time Series and Prediction week-2	2	539	July 24, 2022
Dense function scratch implementation Advanced Learning Algorithms week-1	2	493	October 26, 2022
Convolutional Neural Network, Activation functions AI Discussions	6	245	May 12, 2022
Linear Activation Function Hidden Layer Neural Networks and Deep Learning	3	573	May 25, 2021
Is this activation function always used for every regression problem? Advanced Learning Algorithms week-2	2	495	July 17, 2022

What function is used when no activation function?

Related topics