Week3 - Trigger word detection - Why do we need 2 GRU layers

skyrockets_21 · July 7, 2022, 9:56pm

I find it interesting in the NN architecture that there are 2 GRU layers.

Question 1: why do we need 2 GRU layers? Why is one ‘gate’ not sufficient?

Question 2: why are the 2 GRU layers implemented slightly differently? i.e. why a dropout is added after batch normalization in the second GRU layer?

Thank you!

balaji.ambresh · July 8, 2022, 6:09am

NN architecture is usually constrained by memory and computational resources. To achieve the desired accuracy, the notebook author might have found this architecture to be effective.

Feel free to try a custom NN and observe how results change.

Topic		Replies	Views
Trigger_word_detection: Model design AI Discussions ai-discussions	0	53	February 8, 2024
W3A2 Trigger_word_detection_v2a Sequence Models week-3	4	357	December 29, 2023
RNN Architecture, Why not multi-layer NN inside the cell? Sequence Models	8	273	December 12, 2023
C5W3A2-Reasoning Behind the Network Architecture? Sequence Models	1	235	December 28, 2023
Why 25 units in layer 2 and 15 units in layer 3 for the digital classification model? Advanced Learning Algorithms week-1	2	518	April 23, 2023

Week3 - Trigger word detection - Why do we need 2 GRU layers

Related topics