Stuck, need help

mhkhan4 · December 15, 2024, 4:40am

I was trying a Kaggle exercise for practice. Which predict the insurance data.

However I’m stuck in a local minima. It doesn’t matter what I do, increasing or decreasing the learning rate, feature scaling, feature engineering , the cost is always stuck around 750000.

My work:

model = Sequential([
    Dense(50, activation='relu', input_shape=(X_train.shape[1],), kernel_regularizer=l2(0.01)),
    Dropout(0.2),
    Dense(30, activation='relu', kernel_regularizer=l2(0.01)),
    Dropout(0.2),
    Dense(20, activation='relu', kernel_regularizer=l2(0.01)),
    Dropout(0.2),
    Dense(40, activation='relu', kernel_regularizer=l2(0.01)),
    Dropout(0.2),
    Dense(10, activation='relu', kernel_regularizer=l2(0.01)),
    Dropout(0.2),
    Dense(1, activation='linear')
])

Got similar result with or without the regularization.

Tried playing with learning rate and momentum as well:

model.compile(optimizer=tf.keras.optimizers.Adam(learning_rate=0.008, beta_1=0.9, beta_2=0.999), loss='mse', metrics=['mae'])

Tried different batch size as well:

history = model.fit(
    X_train_scaled, y_train,
    epochs=50,  # Adjust based on performance
    batch_size=128,  # Larger batch sizes for faster training - model will process 128 samples at a time before updating its weights
    verbose=1)

One thing to note: the data is huge 1 million training data

TMosh · December 15, 2024, 7:22am

For this practice exercise, are you intended to use five layers of ReLU units? And have layers with fewer units in the middle of the model? That seems an unusual design.

Did you try a very simple model first?

mhkhan4 · December 15, 2024, 9:45am

Using Dropout for regularization. I did try without it. I tried simpler method like having just 2,3 layers with small number of units. But the outcome is same all the time.

balaji.ambresh · December 15, 2024, 11:48am

Link to the kaggle page please.

mhkhan4 · December 15, 2024, 1:49pm

Link is not allowed. Go to the Kaggle website and try this series: playground-series-s4e12

balaji.ambresh · December 15, 2024, 3:44pm

Why is sharing the link disallowed?

The target variable is a continuous value with quite a large range. Here’s the sample data from the website:

id,Premium Amount
1200000,1102.545
1200001,1102.545
1200002,1102.545
etc.

A NN is going to have quite a challenge to catch up to such target values.

What specializations have you taken so far? (how about tensorflow developer specialization?)
What are your input features?

Topic		Replies	Views
Course 2, Week 3, Kernel error and cost computation problem! Improving Deep Neural Networks: Hyperparameter tun week-3	5	28	July 16, 2024
Dropout cost get nan AI Discussions	4	160	June 11, 2022
Cost function stuck at local minima Neural Networks and Deep Learning	8	1400	July 5, 2024
Help for ML develop a deep learning model AI Discussions ai-discussions , project	10	172	February 2, 2025
Looking for help with Coursera Guided Project: Data Science Coding Challenge: Loan Default Prediction AI Discussions ai-discussions , openai , project , ai-question	6	199	May 21, 2024

Stuck, need help

Related topics