ReLU and sigmoid alternatives in Week 3 assignment

paulinpaloalto · May 4, 2021, 7:16pm

I did ReLU as well and there my results look exactly like yours. The one observation is that on the standard n = 4 case, the learning is really slow. So maybe fiddling with more iterations or a higher learning rate would be worth it, although the LR actually defaults to 1.2 here which is pretty high. But then I just ran 1 - 50 unit test cases with no hyperparameter changes and the n = 50 case actually works pretty well with 85% accuracy:

But the training takes forever. It might be worth a bit more fiddling to see if there’s a sweet spot between n = 20 and n = 50 where we could get good accuracy with reasonable compute cost. The other thing to notice there is that the n = 50 ReLU case actually takes a fundamentally different approach than either sigmoid or tanh at trying to discriminate that cluster of red dots right at the origin. So it looks like maybe it actually does give a qualitatively different solution. Maybe we could get it to do something similar with the blue dots in the upper center of the picture if we gave it even more neurons to work with. Or as @kenb said, maybe the smarter thing would be to try 2 hidden layers. We could probably get much greater complexity with fewer than 50 total neurons. Worth another try after we finish Week 4 and learn how to build the fully general case!

Topic		Replies	Views
Week 3, Programming assignment: how were your performances of sigmoid or ReLu? Neural Networks and Deep Learning coursera-platform	1	628	December 6, 2021
How to apply relu function in Exercise of week 3(optional).) Neural Networks and Deep Learning coursera-platform	5	540	July 12, 2023
Course1 - Week3 Assignment - ReLU gave worse performance than tanh Neural Networks and Deep Learning coursera-platform	3	550	September 9, 2021
ReLU vs Sigmoid function Neural Networks and Deep Learning week-1 , coursera-platform	2	35	December 24, 2024
Week3 - Choice of Activation function Neural Networks and Deep Learning coursera-platform	2	754	February 5, 2022

ReLU and sigmoid alternatives in Week 3 assignment

Related topics