Questions about the number of neurons in the output layer

paulinpaloalto · January 23, 2022, 6:10pm

The output of a multiclass network is input to the softmax activation function and then the “cross entropy” loss function is applied to compute the cost. If you have a dataset with 10 classes and you use a softmax output layer with, say, 13 classes, it doesn’t really do that much harm at least in terms of the prediction accuracy of your model. You’ll have 3 labels that never occur: there are literally no samples that have those values as labels. That means if the network predicts one of those values for a particular sample, the cost function will punish that heavily, because it’s obviously a wrong answer. So assuming that you’ve made good choices for all your other hyperparameters, the trained model you get should never predict those three “extra” classes.

So it should do no harm to the accuracy of your model, but it also does you no good and just wastes memory space and compute cycles. Your training will run slower and it has no other benefit, so it is recommended that you define your output layer correctly.

Topic		Replies	Views
Exercise 4 in lab 1 Introduction to TF for Artificial Intelligence ... week-module-2	1	522	January 14, 2023
Number of neurons in prediction layer Introduction to TF for Artificial Intelligence ... week-module-2	2	532	November 15, 2022
About how to determine the number of layers of the neural network and the number of neurons in each layer Advanced Learning Algorithms week-module-2	1	590	February 6, 2023
Multiclass - class values Advanced Learning Algorithms week-module-2	17	529	December 25, 2022
C2W4 Assignment has wrong expected outputs and confusing output layer Convolutional Neural Networks in TensorFlow week-module-4	13	92	August 30, 2024

Questions about the number of neurons in the output layer

Related topics