Suggestions

Paulo_Cordeiro · October 10, 2023, 1:02am

I’m trying to use a AI model to predict a variable that has 12 classes, and has 14 features and have 144470 training examples. The model start right with a value of 0.65 as AUC and doesn’t improve more after that. There is any rule of thumb that I’m not considering.

Best,
Paul

TMosh · October 10, 2023, 1:48am

Did you try a simpler model first? Like one Dense layer and no Dropouts?

Paulo_Cordeiro · October 10, 2023, 12:44pm

Yes. I started with one dense layer with 32 units and it reachs the top of 0.65 o AUC very fast. I run for more that 1000 epochs and stop learning around the 0.65 of AUC. I tried also different batch sizes. Now I’m using 128 which made the NN reach the better value of AUC faster. Thats why I started to test with more layers and dropouts.

TMosh · October 10, 2023, 7:40pm

When you only have 14 input features, I think any model is going to struggle to train very well when you are trying to learn 3 million parameters based on only 144,000 examples.

Dropout is used to avoid overfitting. You don’t have overfitting (you’re struggling to get high training accuracy). So I recommend you not add any Dropout layers until you get some overfitting.

Making the model more complicated isn’t helping you get better predictions - because you get the same AUC for both a simple model and a complex model.

You didn’t mention what activation function you’re using. Hopefully you remembered to one-hot code the output labels.

Rules of thumb:
Start with one hidden layer, using sigmoid() activation.
The size of the hidden layer could be either:

the square root of the number of input features,
or the average between the number of input features and the number of output labels.

Once you get this working as well as you can, then try adding one more hidden layer (with both hidden layers having the same number of units).

Topic		Replies	Views
Course 1, week 4 assignment 2 Neural Networks and Deep Learning	1	539	December 22, 2021
W4_Overfitting of the Model vs Training Accuracy Neural Networks and Deep Learning	6	592	April 8, 2023
More complex model causes decreased accuracy? Convolutional Neural Networks in TensorFlow week-4	2	595	December 8, 2021
MLS course 2 week1 - layers and units in a layer Advanced Learning Algorithms week-1	4	537	November 13, 2022
Training accuracy 86% validation 94%, required training accuracy 99% validation accuracy 95% Convolutional Neural Networks in TensorFlow week-4	17	1579	June 19, 2023

Suggestions

Related topics