Week 3, Programming assignment: how were your performances of sigmoid or ReLu?

paulinpaloalto · December 6, 2021, 4:12pm

It is great that you are trying this type of experiment! You always learn something interesting when you extend the material and try things like this. It is possible to get almost as good performance using ReLU, but it requires a lot more than 4 neurons in the hidden layer. It is great that you experimented with the other hyperparameters like learning rate and number of iterations. There is no guarantee that the same combination that worked well with tanh will work with the others. There is actually quite a close relationship mathematically between tanh and sigmoid, so I would expect you could also get essentially the same results with a little tweaking of the learning rate and number of iterations. Here’s a thread about the relationship between tanh and sigmoid.

But ReLU is a different matter. Here’s an earlier thread that gives some results other students have gotten applying ReLU to this problem.

Thanks for sharing the results of your experiments! This is all an experimental science!

Topic		Replies	Views
ReLU and sigmoid alternatives in Week 3 assignment Neural Networks and Deep Learning coursera-platform	11	887	July 20, 2022
W 3_A1_ReLU vs tanh accuracy Neural Networks and Deep Learning coursera-platform	8	650	November 13, 2022
How to apply relu function in Exercise of week 3(optional).) Neural Networks and Deep Learning coursera-platform	5	540	July 12, 2023
Course1 - Week3 Assignment - ReLU gave worse performance than tanh Neural Networks and Deep Learning coursera-platform	3	550	September 9, 2021
W3 A1 Relu Activation doesn't work Neural Networks and Deep Learning week-module-3 , coursera-platform	2	38	October 29, 2024

Week 3, Programming assignment: how were your performances of sigmoid or ReLu?

Related topics