How to apply relu function in Exercise of week 3(optional).)

paulinpaloalto · July 12, 2023, 8:27pm

It’s a great idea to experiment with using ReLU as the hidden layer activation in this exercise. You always learn something interesting when you try to extend the ideas in the course. Of course you will need to change more than just the forward prop logic: the derivatives of the activation functions affect the back prop as well.

There have been a couple of other threads about this in the past, e.g. this one. I was able to get pretty good accuracy using ReLU but it requires a lot more hidden units to get results equivalent to what you can get with tanh and 4 hidden units.

Topic		Replies	Views
Course1 - Week3 Assignment - ReLU gave worse performance than tanh Neural Networks and Deep Learning	3	548	September 9, 2021
Week 3, Programming assignment: how were your performances of sigmoid or ReLu? Neural Networks and Deep Learning	1	628	December 6, 2021
W3_A1_ReLu as Activation function Neural Networks and Deep Learning	3	622	March 30, 2023
ReLU and sigmoid alternatives in Week 3 assignment Neural Networks and Deep Learning	11	884	July 20, 2022
W3 A1 Relu Activation doesn't work Neural Networks and Deep Learning week-3	2	36	October 29, 2024

How to apply relu function in Exercise of week 3(optional).)

Related topics