Polynomial regression using Neural network

amitsubhashchejara · September 3, 2024, 11:57am

How can a neural network be used for polynomial regression?
Let’s consider a NN with 4 layers and one unit within each layer. The computation looks something like this:

x->g(wx+b)->g[w(g(wx+b))]->…

If my data is of a quadratic form, how can this NN learn to fit a quadratic curve since it never computes x^2?

Nevermnd · September 3, 2024, 12:33pm

@amitsubhashchejara I am going to let one of the Mentors whom is better with maths give you an improved explanation of this-- But, in my mind it is better just to consider that your data assumes a polynomial form.

So, if in your case your data is created with a quadratic equation, then that is the form the data takes. However, what the NN actually produces will not be that exact same equation-- instead it finds the series of weights which, in conjunction and combination with those of other nodes, ‘fits it’.

And, I know you are offering a theoretical (i.e. one node per layer-- basically linear, thus asking how do I get non-linear behavior out of something linear)… Well… Not to avoid a full answer, but you’d never have only one node per layer.

It just doesn’t work like that.

amitsubhashchejara · September 3, 2024, 1:58pm

Thank you for the clarification, but let’s say that we have a NN with all the activations set to ReLU, which is linear. Now the output will be a function of x no matter the number of nodes and layers. So the line that fits the data will change its slope but never bend to become a quadratic curve. Please help with this!!

nadtriana · September 3, 2024, 2:15pm

You’re right that ReLU is a piecewise linear activation function, which means that the network’s output remains a piecewise linear function of the input. This can make it difficult for the network to directly fit a quadratic curve, especially if ReLU is the only activation function used.

Even with ReLU, the network can approximate a quadratic function by creating a series of linear segments that piece together to resemble a curve. However, this requires a sufficiently deep network or many units per layer to capture the necessary breakpoints where the slope changes. The more layers and units, the better the approximation, but it’s still fundamentally a piecewise linear function.

Consider using other activation functions, such as sigmoid or tanh, which are inherently nonlinear and can bend to fit curves, to more effectively model a quadratic curve. Another approach might be to mix ReLU with other activations in the network to combine the strengths of both linear and nonlinear modeling. This combination would help the network work better with quadratic and other nonlinear types of relationships.

Hope it helps!

TMosh · September 3, 2024, 2:38pm

I am curious why you have applied this constraint. That is a very minimal NN architecture.

Nevermnd · September 3, 2024, 2:43pm

@TMosh I think they are imagining being able to recreate the exact original form of the quadratic equation (i.e. one term each layer/node).

amitsubhashchejara · September 3, 2024, 3:11pm

Yeah, I applied this constraint to keep things simple

TMosh · September 3, 2024, 5:23pm

In practice it’s a lot simpler if you have one hidden layer with a few ReLU units.

Give it a try, please report back your results.

TMosh · September 3, 2024, 5:25pm

There is a forum thread where I detailed how this process works (e.g. modeling the equation of a parabola), if you’re interested I can try to find a link.

But I recommend you work up this example yourself. It’s very educational.

Topic		Replies	Views
Neural network with polynomial features Advanced Learning Algorithms week-3	5	422	August 30, 2023
Polynomial Feature as Hidden Unit Neural Network Advanced Learning Algorithms week-2	3	604	March 4, 2023
Non linearity of neural networks Advanced Learning Algorithms week-2	1	31	August 19, 2024
Neural Network Linear Regression Neural Networks and Deep Learning	1	569	May 24, 2021
Neural Network with linear regression Supervised ML: Regression and Classification week-2	2	497	August 17, 2022

Polynomial regression using Neural network

Related topics