Why use a Dense 512 node layer before the final binary classification node?

Brendon_Wolff-Piggot · September 26, 2022, 9:27am

The code used in the course uses two Dense layers after the final max_pooling and flattening eg:
tf.keras.layers.MaxPooling2D(2,2),
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(512, activation=‘relu’),
tf.keras.layers.Dense(1, activation=‘sigmoid’)

What is the purpose of the Dense layer with 512 nodes? I tried doubling the number of nodes here (to 1024) and dropping it altogether, and didn’t notice much change in the accuracy of the network predictions. This was in the assignment at the end of Week 1.

Thanks!

alvaroramajo · September 26, 2022, 12:15pm

Hi, @Brendon_Wolff-Piggot !

The final dense layers are the ones that are going to extract the information for the final output given the features extracted by the CNN’s. The exact number of neurons is nothing strictly stipulated and you can try with a different configuration to further optimize the network performance.

Topic		Replies	Views
Dense in between convolutional layers: Is it feasible? Convolutional Neural Networks in TensorFlow week-module-1	1	552	March 28, 2022
Adding a dense layer Convolutional Neural Networks coursera-platform	10	740	August 22, 2022
Dense Layer's Units Natural Language Processing in TensorFlow week-module-2 , week-module-3 , week-module-4	1	540	September 19, 2022
TF1_C1_W3_LAB1 Experiment with convolution layers Introduction to TF for Artificial Intelligence ... week-module-3	3	519	December 25, 2022
Why FC layers are always there at end in CNN? Convolutional Neural Networks coursera-platform	1	595	June 1, 2021

Why use a Dense 512 node layer before the final binary classification node?

Related topics