Help for implementing my own deeplearning model

pog13 · March 16, 2024, 10:59am

Hi.
I am trying to implement my own neural network model using numpy and a csv dataset and pandas
Now im running to these problems
When i train my model without changing anything somtimes it works well and accuracy is ok but sometimes its not it changes and switches between these two sates and sumtimes prediction gets nan; and i tried various hyperparameters but the results still the same
Sometimes cost during trainig decreases ok and sometimes with those same hyperparameters statys the same for each batch and its almost constant
:

i would appreciate it if you’d take look

Nevermnd · March 16, 2024, 11:05am

I haven’t had a chance yet to look at your model, but up front, what do you mean by ‘sometimes’ ? Are you changing your training set/hyperparameters each time (which if you seek consistency at least, obviously you shouldn’t) ?

pog13 · March 16, 2024, 12:04pm

For each set of hyperparameters that i chosed for that specific set i got diffrend results without changing hyperparameters

TMosh · March 16, 2024, 5:42pm

General note:
Neural networks do not have convex cost functions - so there can be local minima.

But more likely, you could be making better choices about the weight initialization, learning rate, or feature normalization.

Amjad_Bakri · March 16, 2024, 7:10pm

i think the problem come from the sigmoid function as it output NAN sometimes you can add epsilon to it to avoid alot of problems like that

def sigmoid(z):
epsilon = np.finfo(z.dtype).eps
return 1 / (1 + np.exp(-z + epsilon))

i will let me own implementation to help u understand what I mean.

github.com

AmjadBakri/neural-network-implementation-from-scratch/blob/main/Untitled.ipynb

{
 "cells": [
  {
   "cell_type": "markdown",
   "id": "89c5b225",
   "metadata": {},
   "source": [
    "# Import the needed libraries"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "id": "517df16e",
   "metadata": {},
   "outputs": [],
   "source": [
    "import numpy as np\n",
    "from sklearn.metrics import accuracy_score\n",
    "from sklearn.utils import shuffle\n",

This file has been truncated. show original

pog13 · March 18, 2024, 3:53pm

Thank you

pog13 · March 18, 2024, 3:55pm

Thank you
I tried your aprpach and chose better hyperparameters and know its a bit more stable but now it generaly overfits so i need to implement regularization to see if it gets better

Sahil2 · April 1, 2024, 6:29pm

Look for the numerical stability of all the formulas in your implementation, specially sigmoid and log related formulas. I had this problem of getting nan outputs due to this.

TMosh · April 1, 2024, 6:33pm

If you normalize the features of the data set, you generally will not have numerical issues with sigmoid or log().

Topic		Replies	Views
NAN as results for the cost computations Neural Networks and Deep Learning coursera-platform	27	608	December 27, 2021
General implementation of deep neural network for multi class classification problem, using course 1 and course 2 Improving Deep Neural Networks: Hyperparameter tun week-1 , coursera-platform	13	295	January 5, 2024
Dropout cost get nan AI Discussions	4	161	June 11, 2022
Need help on my own deep learning implementation from scratch AI Discussions ai-discussions	1	79	June 3, 2023
Deep Neural Network Improving Deep Neural Networks: Hyperparameter tun week-1 , module-2 , coursera-platform	15	100	May 30, 2025

Help for implementing my own deeplearning model

Related topics