MATLAB code for hidden layer assignment DLS1 week3

DrR0bot · March 10, 2022, 4:39pm

Hi guys,

In order for me to understand better this assignment, I decided to create the same process in MATLAB. Although I am getting some quite different results and is getting difficult to follow.

paulinpaloalto · March 10, 2022, 5:09pm

One of the salient differences between MATLAB and python is that indexing in MATLAB is 1-based and in python it’s 0-based, right? Of course in MATLAB you have the beautiful way that vectors and matrices are just a natural part of the language and you don’t have to deal with numpy, as a separate library.

paulinpaloalto · March 10, 2022, 5:19pm

I took a quick look at your code. There must be a better way to handle the way you pass the parameters as lists. I used to know MATLAB pretty well, but haven’t written any in a few years. Don’t they have an index data structure like python dictionaries? You also shouldn’t need to pass m around, since you can deduce it from the shapes of the X, A or Z values, right?

The one thing I can see that is clearly different is your initialization function:

You multiply by 0.001, instead of 0.01. But the other issue is that you are using rand, which is the uniform distribution on [0,1], right? In the course notebook, we use the Normal Distribution with \mu = 0 and \sigma = 1. Big difference! Initialization matters …

Also you have a “copy/paste” error in the comments in your tanh function.

DrR0bot · March 10, 2022, 5:28pm

Thanks for that! I guess you are right about the initialization of the parameters. Let me make some adjustments. I was playing with the 0.01 value because although I am getting the first correct value of the cost function as the exercise, then later all turns bad.

DrR0bot · March 10, 2022, 5:37pm

Yes, the rand function is a uniform distribution. But I am getting this crazy iteration.
I have edited this reply with a more suitable learning rate value of 0.05. I will try to run the code with other data; perhaps the way I import it is incorrect.
Learning rate 0.05

paulinpaloalto · March 10, 2022, 5:46pm

Hmmm, there can be some oscillation with that high a learning rate, but that does look kind of crazy. The first question is whether your back prop logic is correct. My take is that you have not implemented the derivative of tanh correctly. Have a look and compare that code to what you wrote in the notebook.

DrR0bot · March 10, 2022, 5:49pm

Yeah, that could be the reason…

DrR0bot · March 10, 2022, 5:52pm

Yep SIR! a little mistake in the derivative for tanh!
Learning rate 0.05_V2

paulinpaloalto · March 10, 2022, 6:13pm

Very cool! That’s a nice looking convergence curve!

Just out of curiosity, is that curve with the learning rate of 1.2 or with a lower rate?

DrR0bot · March 10, 2022, 6:58pm

That one is with a lower rate of 0.05. Using 1.2 is quite steep convergence. I am happy with the results. Thank you so much for taking the time to check my code!

Topic		Replies	Views
DS2 - Module 2 - Regularization assignment - test fails despite correct code Improving Deep Neural Networks: Hyperparameter tun coursera-platform	5	501	May 21, 2023
Week 1 Assignment 1 Exercise 3 : len-1? L+1? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	559	May 14, 2021
DLS Course 1- Week 3- ex3 - initialize parameters Neural Networks and Deep Learning coursera-platform	2	517	October 25, 2021
DLS C1 W4 Programming Assignment 1 Exercise 2 test case issue? Neural Networks and Deep Learning week-module-1 , week-module-4 , coursera-platform	2	11	March 28, 2025
Week 3 - Video (1-3) Neural Networks and Deep Learning coursera-platform	6	562	April 9, 2022

MATLAB code for hidden layer assignment DLS1 week3

Related topics