Week 1 - Gradient Function explanation

tamalmallick · October 11, 2024, 1:59am

Link to the classroom item you are referring to:

Hello Mentors,

I am working on the optional lab 04 of the week 1. I am going through the following notebook.
C1_W1_Lab04_Gradient_Descent_Soln.

I am trying to understand the gradient_function(x, y, w , b). Can someone share the location or the file where the gradient_function() is defined?

TMosh · October 11, 2024, 2:19am

“gradient_function” is the name of one of the function arguments.

It’s a pointer to whatever function is used when the code in your image is called.

tamalmallick · October 19, 2024, 8:57pm

Could you elaborate a little more?

I am unable understand how without defining the gradient_function you called it.

TMosh · October 19, 2024, 10:10pm

See this bit of code from the lab:

When gradient_descent() is called it passes “compute_gradient” as the name of the function.

The definition of the gradient_descent() function uses the 8th argument as the “gradient_function”.

tamalmallick · October 21, 2024, 1:42am

I missed that part. I should have observed it better. Thank you so much for explaining with screenshots. It was very helpful.

taveetas · October 25, 2024, 7:19pm

Hi, I had the same question as tamalmallick. I think based on your response I am starting to understand that the final call of gradient_descent( ) does use the def functions compute_cost and compute_gradient.

Why use the pointers cost_function and gradient_function when defining gradient_descent? Couldn’t the lab have inputed compute_cost and compute_gradient directly?

paulinpaloalto · October 25, 2024, 7:32pm

Yes, they could have just called those functions directly, but they are just showing you a way to write more flexible “general” code in python. In python you can pass references to functions as parameters to your functions. So they have written a general gradient_descent function that can be used with different cost functions.

For example, suppose that you were considering two different cost functions for a given problem. With the code they wrote, you would only have to write the gradient descent logic once and then you could try both cost functions and compare the results.

Of course note that the compute_gradient function will be paired with the compute_cost function. If you use a different cost function, then the gradient values will be driven by that algorithm.

taveetas · October 31, 2024, 6:39pm

That context is really helpful, thank you for clarifying!

sofi_z · November 8, 2024, 3:34pm

For me it was also rather confusing at the beginning. I know it´s more general, but back then I need something more understandable. Here is some code I simplify for myself for better understanding (and I also remove J, it´s nice to see but I am a beginner here, so the smaller the code is that does something, the more understandable it is)

TMosh · November 8, 2024, 4:09pm

I admire your help, but I must delete the code from your reply, because sharing your code for a graded assignment is not allowed.

Topic		Replies	Views
C1_W1_Lab04_Gradient_Descent_Soln - Functions Supervised ML: Regression and Classification week-1	2	329	January 13, 2024
Question about Gradient Descent Lab Supervised ML: Regression and Classification week-1	4	565	August 4, 2022
Gradient descent function question Supervised ML: Regression and Classification week-2	4	548	July 7, 2022
Week 2 lab notebook Supervised ML: Regression and Classification week-2	5	34	March 28, 2025
Where is the function "gradeint_function" defined in the code for Lab04? Supervised ML: Regression and Classification week-1	5	150	May 12, 2024

Week 1 - Gradient Function explanation

Related topics