Changing loss function - what changes in gradient descent?

Rodrigo_Pedro · September 20, 2022, 12:14am

I know there are different loss functions other than cross entropy. My (potentially stupid) question is: if i wanted to try a different loss function, what would i need to change in gradient descent in terms of calculations? Only dA^{[L]}? Do all formulas shown in the embedded image still apply?

imagem

Rashmi · September 21, 2022, 4:59pm

Hello, Rodrigo Pedro.

To answer your query in a mathematical way, you need to look at the equation again. Things are in direct proportionality. If a part of it is increasing then, it will have the same impact over the other part present in the right hand side of the equation.

So, every action will have similar consequences on each of the terms involved on both sides of the equation.

Topic		Replies	Views
Gradient descent in m examples Neural Networks and Deep Learning	2	529	October 26, 2021
Gradient Descent Doubt AI Discussions	7	114	July 11, 2022
Question on C2_W2_Lab_2 Calculus for Machine Learning and Data Science week-2	1	393	February 13, 2023
Gradient Descent Slope Neural Networks and Deep Learning	1	613	April 7, 2022
What is the role of ReLu derivative? Neural Networks and Deep Learning week-3	3	276	May 4, 2024

Changing loss function - what changes in gradient descent?

Related topics