A doubt on gradient checking

JJaassoonn · August 20, 2023, 2:24pm

Dear Mentor,

Could you please guide me on this issue?

Regarding to this formula, there is a statement in the lecture mentioning that

“the row for the denominator is just in case any of these vectors are really small or really large, the denominator turns this formula into a ratio”

May i have any mathematical example to understand this statement?

Thank you.

paulinpaloalto · August 20, 2023, 4:17pm

You need to “scale” the results of the check by the sizes of the actual vectors you are approximating. Suppose the difference value comes out to be 0.5. How do you know if that’s a big error or not? If the norm of the actual vectors are say 10^6, then 0.5 is a pretty small error. But how about if the norms are 1? Then it’s a pretty big error, right?

Or think of it this way: you’re converting the error into a “percentage error” without the factor of 100. If I’m measuring the distance from here to the moon, then 1 meter is a pretty small error. One meter divided by the distance from here to the moon is a small number. If I’m measuring the length of my left arm, then 1 meter is a pretty big error.

When dealing with approximation error, the scale matters.

rmwkwok · August 21, 2023, 12:41am

Hello @JJaassoonn ,

You may also do this kind of checking yourself to get some hands-on.

Raymond

paulinpaloalto · August 21, 2023, 3:48am

Thanks, Raymond! That’s a really cool example that concretely demonstrates the point I was just talking about in general terms. I’ve bookmarked that thread and will use it if this question comes up again!

JJaassoonn · August 21, 2023, 3:48am

Dear Mr Paul Mielke,

Thank you so much for your guidance.

JJaassoonn · August 21, 2023, 3:55am

Dear Mr Raymond,

I have studied the example from the discussion thread. Thank you so much for sharing it.

rmwkwok · August 21, 2023, 8:41am

That was an example of how we can understand something by just working it out. Hope that will help your future study.

Topic		Replies	Views
Gradient Checking ____ Euclidean distance Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	451	July 17, 2023
C2W1 - Theory behind Gradient Checking formula? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	11	557	August 7, 2023
Week 1 Gradient Checking Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	676	June 18, 2021
Gradient Checking - Course 2 Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	553	December 1, 2021
Normalize question from gradient checking slide Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	487	October 14, 2022

A doubt on gradient checking

Related topics