Computing from scratch gradient descent for neural networks

fragodec · January 20, 2023, 8:01am

In the first course we derived the explicit formulas for gradient descent in the case of regression. I am wondering if anyone has a reference where they carry out all the computations for doing the latter but in the case of a simple two-layer network with for example ReLU activations and mean squared error loss.

rmwkwok · January 20, 2023, 9:00am

Hello @fragodec,

Welcome to our community! The course 1 of the deep learning specialization will tell you about gradient descent for a multi-layer neural network. Although it focuses on classification, if you understand the materials in that course and what you have learnt in this specialization, you should be able to change the cost function to MSE and make it work for a regression problem.

Raymond

TMosh · January 21, 2023, 3:24pm

The computations for backpropagation were covered in Andrew’s original Machine Learning course, but they’re not offered here.

This article covers the basics. It’s not simple.

Note that the examples you find online usually use an NN with a linear output. This is because the backpropagation calculations for a linear output are a lot simpler than if you have a logistic output (which you would use for classification). The linear output’s cost function is the simple sum-of-squared errors variety.

Topic		Replies	Views
NN Training from Scratch Advanced Learning Algorithms week-2	3	482	January 7, 2023
Gradient descent in neural network Advanced Learning Algorithms week-1	3	342	December 27, 2023
Backpropgation Advanced Learning Algorithms week-2	5	385	August 8, 2023
In the first hidden layer, how does the neuron determine values for w,b ? Isn't gradient descent needed to do that? Advanced Learning Algorithms week-1	3	503	October 17, 2022
Week 4 Exercise 9 - Backpropagation, L_model Neural Networks and Deep Learning	4	726	August 11, 2022

Computing from scratch gradient descent for neural networks

Related topics