Help for ML develop a deep learning model

papa_moussa · January 27, 2025, 9:59pm

Hello , I am currently working on new project which consist to develop a model which can predict price of claim insurance depend to informations about clients, contracts and claims datas. The trainins set had over 5400 examples of data . Iused a neural network model and I got overfitting with low MSE on training set and big MSE on CV and Test set. I’ve try different method, like feature engineering, regularization term tuning, adjust number of epochs…But I still got the same thing, low MSE on training set and big difference on Cv and Test set .

Please I need help, because, it’s an important project for me !

TMosh · January 28, 2025, 5:50am

Did you try just adding regular old L2 regularization?

As an experiment, keep increasing the lambda value until you get a much higher training cost. Then check the validation cost and compare the two.

gent.spah · January 28, 2025, 7:23am

Also I think, your dataset seems to be small and probably has much variability (meaning different data not much related to each other). You might need a bigger dataset which can introduce less variability among the data.

In the cases of small datasets I think Cross Fold Validation can help in improving performance too!

papa_moussa · January 29, 2025, 10:13am

I did a loop throught different values of lamba, and try to find with value of lamda I will get the balance between Training MSE, Test and Cv MSE.

For differents values of lamba, I got the balance with high errors ( Around 1.xxx) for training, Cv and Test MSE.

To be more precious, I’ve scaled the x train and y train, so error need to be close to 0 …

TMosh · January 29, 2025, 5:58pm

You’ll only get a cost near zero if the model is complex enough to give a good fit.

Your results suggest the model isn’t complex enough, or it isn’t training until convergence.

What optimizer are you using?

If that doesn’t help, then maybe try adding more hidden layer units, or more hidden layers.

papa_moussa · January 30, 2025, 8:48pm

I am using Adam optimizer.
Normally, if data are scaled for the training part, a good alorithm should return a near 0 no?

TMosh · January 30, 2025, 9:45pm

In practical use, the minimum cost is never going to be zero.

And that’s not the goal - the goal is to find the weights that give the minimum cost - it does not have to be a small value.

TMosh · January 30, 2025, 9:45pm

What parameters are you setting?

papa_moussa · January 31, 2025, 1:44pm

I was training, neural network,
If I use a simple with low number layers in neural network, I got high error, in training and cv set.
If I am trying to go further away by adding layers no make the model fit well, I willk get overfitting, I was trying to apply features engineering, or L2 regularization, or try to adjust the nulber of epochs and batch size, but still got the overffiting issue.

TMosh · January 31, 2025, 6:10pm

Adding layers has nothing to do with the Adam optimizer settings.

santoshusa2016 · February 2, 2025, 3:23pm

increase training samples
reduce features
implement regularization as suggested by few
some implement historical based prediction

Topic		Replies	Views
No overfitting issue without dropout layers on "Predicting Sunspots with Neural Networks" Sequences, Time Series and Prediction week-module-4	19	456	September 28, 2023
Neural Network Training Neural Networks and Deep Learning week-module-5	9	141	June 29, 2025
Neural Network linear regression model is not able to improve it's accuracy AI Discussions	13	94	September 29, 2023
Mean Squared Error Too Big (>6) Sequences, Time Series and Prediction	2	404	January 2, 2024
Tips on applying regularization AI Discussions	18	108	November 22, 2023

Help for ML develop a deep learning model

Related topics