Can Linear Regression Use Neural Networks?

Weiming_Xu · October 20, 2021, 5:43pm

If so, what’s the activation function?
If the activation function is simply linear, multiple hidden layers become a simple linear function

paulinpaloalto · October 20, 2021, 7:34pm

But the point is that the hidden layers are not linear, right? So this is not “linear regression”: it is applying a DNN to solve a regression problem. This is perfectly possible: you just have to choose an appropriate output layer activation (as you say) and then a cost function. Depending on the nature of your output values (e.g. can they be negative) you can either just use the linear output or apply ReLU at the output layer. Then you’ll probably want to used a distance based loss function like MSE.

Weiming_Xu · October 21, 2021, 3:00am

Please show me some examples of the activation functions for linear regression.

paulinpaloalto · October 21, 2021, 3:31am

You can just use either ReLU if the values need to be positive or just use no activation function at all at the output layer if negative values are meaningful for whatever the quantity is that you are trying to predict.

Weiming_Xu · October 21, 2021, 9:58pm

If no activation function at all at the output layer, NN becomes linear regression, no matter how many hidden layers.

paulinpaloalto · October 21, 2021, 11:12pm

That is not true: we are only talking about the output layer here, right? The point is that there are non-linear activations in all the hidden layers. You can choose the functions to use, e.g. sigmoid, tanh, swish, ReLU, Leaky ReLU etc. The point is that it is only at the output layer that we would consider just using the linear output in this type of case.

Weiming_Xu · October 22, 2021, 12:36am

sigmoid, tanh, swish, ReLU, Leaky ReLU etc. restricts output to (-1, or) or positive.

do we have activation functions that output both large positive and large negative?

paulinpaloalto · October 22, 2021, 2:18am

The range of Leaky ReLU is (-\infty, \infty). Or if you simply don’t use an activation function at the output layer, then the range is also (-\infty, \infty).

Topic		Replies	Views
Neural Network Linear Regression Neural Networks and Deep Learning coursera-platform	1	591	May 24, 2021
Neural Network with linear regression Supervised ML: Regression and Classification week-module-2	2	511	August 17, 2022
Why do you need Non-Linear Activation Functions? Neural Networks and Deep Learning coursera-platform	3	717	March 15, 2022
Linear Activation Function Hidden Layer Neural Networks and Deep Learning coursera-platform	3	605	May 25, 2021
In NN are activation function alway logistic regesstions? Advanced Learning Algorithms week-module-1	2	489	February 14, 2023

Can Linear Regression Use Neural Networks?

Related topics