Derivative of Relu in output layer

paulinpaloalto · November 23, 2022, 3:33pm

This is probably covered on Rashmi’s link, but it looks like there is some definite pattern in the wrong answers: they are all just 0 and they seem to alternate with the correct answers. Seems like it’s worth some analysis to see if you can see any patterns in the inputs that give bad results versus the good ones. The other thing to consider is that maybe it’s not such a great idea to use ReLU as the output layer activation. The reason you are getting zero answers must be that the predictions were negative at the linear activation level, right? Try using Leaky ReLU and see if that gives negative predictions for some values. I assume negative values would not make sense in your application. Other possibilities would be swish. Or if any output value between -\infty and \infty makes sense, just eliminate the output activation altogether.

Topic		Replies	Views
Spikes in cost function plot for deep "relu" nn Neural Networks and Deep Learning coursera-platform	24	843	November 4, 2021
Feedforward Neural Networks in Depth Deep Learning Resources coursera-platform	68	112907	September 20, 2025
Having some trouble on week 2 lab Neural Networks and Deep Learning week-module-2 , coursera-platform	21	87	February 8, 2025
What is the Cost Function for Softmax? Advanced Learning Algorithms week-module-2	121	650	May 18, 2025
Week 2 Logistic Regression with a Neural Network Mindset Neural Networks and Deep Learning coursera-platform	24	877	September 5, 2021

Derivative of Relu in output layer

Related topics