W3_A1_ReLu as Activation function

Abhishek_Bhosale · March 29, 2023, 7:23pm

has anyone tried using ReLU activation function in week 3 programming assignment of course 1?
I tried using it and I am getting accuracy of only 59 % and the boundary layer is similar to that of logistic regression. Can anyone let me know why is it so?

paulinpaloalto · March 29, 2023, 7:34pm

Yes, you can get this to work, but it requires a couple of things:

First check that you did the complete implementation of ReLU: it’s not just forward propagation that is affected, right? You need to modify your back prop logic as well. The derivative of ReLU is different than the derivative of tanh.

Then it just turns out that you need quite a few more neurons in the hidden layer and more iterations in order to get reasonable performance from ReLU on this particular task.

Here’s a thread from a while back that is on this same topic and goes into quite a bit of detail. And here’s one about the derivative of ReLU.

paulinpaloalto · March 29, 2023, 7:37pm

Here’s another good thread I found by searching for “planar data relu”.

Abhishek_Bhosale · March 30, 2023, 5:18am

Got it, Thanks a lot

Topic		Replies	Views
Course1 - Week3 Assignment - ReLU gave worse performance than tanh Neural Networks and Deep Learning coursera-platform	3	550	September 9, 2021
How to apply relu function in Exercise of week 3(optional).) Neural Networks and Deep Learning coursera-platform	5	540	July 12, 2023
Relu/LRelu does not work for forward propagation in Planar_data_classification_with_one_hidden_layer Neural Networks and Deep Learning coursera-platform	10	840	November 14, 2021
Week 3, Programming assignment: how were your performances of sigmoid or ReLu? Neural Networks and Deep Learning coursera-platform	1	628	December 6, 2021
W3 A1 Relu Activation doesn't work Neural Networks and Deep Learning week-3 , coursera-platform	2	37	October 29, 2024

W3_A1_ReLu as Activation function

Related topics