Cost = Nan

paulinpaloalto · January 18, 2025, 7:39pm

The cost will become NaN if your \hat{y} value rounds to exactly 0 or 1. Of course \hat{y} is the output of the sigmoid function, so it can never be exactly 0 or 1 if we are doing pure math using \mathbb{R}. But in floating point, everything is an approximation and we can end up with exactly 0 or 1. You have several approaches to deal with that:

The first approach is to understand in more detail what is happening. E.g. instrument your code to track how close the values are getting to 0 or 1. In 64 bit floating point, I think z > 35 is enough to give you sigmoid(z) = 1. Maybe you need to use a smaller learning rate or a smaller iteration count. Of course it also matters what the accuracy of your predictions is.
You can actually put a defense mechanism into your cost logic to protect against the rounding to 0 or 1. Here’s a thread which discusses that in more detail.

Topic		Replies	Views
Getting -inf and nan as cost value in some iterations Neural Networks and Deep Learning coursera-platform	2	540	July 11, 2021
Logistic Regression cost function with rounded off Sigmoid calculations Neural Networks and Deep Learning coursera-platform	5	708	April 6, 2022
Course 1 Week 2 A2 np dot leads to nan Neural Networks and Deep Learning coursera-platform	3	509	July 14, 2023
Nan in week 3 assignment Neural Networks and Deep Learning coursera-platform	10	693	May 2, 2021
Practice Lab: Gradient Descent Function Returning NAN Values For Cost Supervised ML: Regression and Classification week-module-3	1	519	July 5, 2022

Cost = Nan

Related topics