Week 3, Programming assignment: how were your performances of sigmoid or ReLu?

Alberto_Ingenito · December 6, 2021, 2:42pm

I tried to change tanh activation function with sigmoid and ReLu on hidden layers. With sigmoid and same hyperparameters the model achieved slightly lower accuracy (from 90% to 89%). With ReLu accuracy was much lower (~60%).

I even tried to reduce learning rate, extend the number of iterations and doubling the number of hidden nodes.

Had anyone similar results?

I expected better performances using ReLu.

paulinpaloalto · December 6, 2021, 4:12pm

It is great that you are trying this type of experiment! You always learn something interesting when you extend the material and try things like this. It is possible to get almost as good performance using ReLU, but it requires a lot more than 4 neurons in the hidden layer. It is great that you experimented with the other hyperparameters like learning rate and number of iterations. There is no guarantee that the same combination that worked well with tanh will work with the others. There is actually quite a close relationship mathematically between tanh and sigmoid, so I would expect you could also get essentially the same results with a little tweaking of the learning rate and number of iterations. Here’s a thread about the relationship between tanh and sigmoid.

But ReLU is a different matter. Here’s an earlier thread that gives some results other students have gotten applying ReLU to this problem.

Thanks for sharing the results of your experiments! This is all an experimental science!

Topic		Replies	Views
W 3_A1_ReLU vs tanh accuracy Neural Networks and Deep Learning	8	645	November 13, 2022
Course1 - Week3 Assignment - ReLU gave worse performance than tanh Neural Networks and Deep Learning	3	548	September 9, 2021
How to apply relu function in Exercise of week 3(optional).) Neural Networks and Deep Learning	5	540	July 12, 2023
ReLU and sigmoid alternatives in Week 3 assignment Neural Networks and Deep Learning	11	884	July 20, 2022
DL and NN course1 Week#3: Understanding Activation functions Neural Networks and Deep Learning week-3	2	30	March 4, 2025

Week 3, Programming assignment: how were your performances of sigmoid or ReLu?

Related topics