W3A2 Does the order of layers matter?

Chixing_Wei · November 23, 2022, 2:08pm

In the function modelf(), after Conv1D layer we have BatchNormalization layer come before Dropout layer, while after the first GRU Layer we have Dropout come before BatchNormalization layer. Does the order matter and how to choose the order?

Thank you!

kchong37 · November 23, 2022, 4:14pm

Hi there, I think this is a good question.
From my point of view, the order matters because it is two different structures of the network, although sometimes the difference can be negligible. I found a post that might be helpful, and I also look forward to other replies with insights.

dds · November 23, 2022, 5:10pm

I would think it should matter. Even if in some empirical experiment outputs may not be different, that does not mean that at higher scale-ups or different input data sets differences would not show up.

First, last layer usually would be have one unit for two class classification. Having L1 with 1 unit and last layer havng 4 units would just be wrong design.

Second it would seem that the input data has most complexity (least ordered, high entropy) and as we go from left to right we are reducing the input complexity to a much lower complexity of two class classification situation.

So may be there is a natural ordering of Layer sizes from input to output where size of layer k >= layer k + 1. I am just guesisng.

Chixing_Wei · November 24, 2022, 9:10am

Your reply is very helpful, seems there is an on-going discussion regarding this question, will dig deeper into it. Thank you!

Topic		Replies	Views
Batch Norm/Drop out ordering Improving Deep Neural Networks: Hyperparameter tun	2	601	July 7, 2021
Dropout - batchnorm or batchnorm - dropout : which order is the appropriate? Sequence Models week-3	3	511	January 8, 2025
Order of layers in the sequential model Natural Language Processing in TensorFlow week-3	5	328	April 10, 2023
Dropout Layer order - impact on the result? Natural Language Processing in TensorFlow	3	354	August 31, 2022
Does Flattening Order make a difference to the FC layers? Convolutional Neural Networks	13	511	October 24, 2023

W3A2 Does the order of layers matter?

Related topics