[Week3: Trigger Word Detection] modelf architecture

Eslgr · May 18, 2021, 11:10am

Hey guys!

I’m unsure if I understand the architecture of modelf in the Trigger Word Detection correctly: After the convolutional layer, each sample consists of 1375 time steps, with each time step having 196 features.
After that, batch normalization is applied with respect to the last axis (i.e. the feature axis). So, normalization on the feature axis is applied, using the same parameters for each time step, right?
And regarding Dropout: I understand the dropout within a normal DNN layer (some units of the layer are ‘switched off’), but I don’t understand it in this context. I think for each sample, 20% of the 1375*196 signals are just not passed to the next step. But it’s just 20% of the signals in total, and not paying attentention that it’s evenly distributed and catches 20% of the 1375 time steps and 20% of the 196 features respectively, correct?

Best, Elke

TMosh · July 5, 2021, 5:02am

Do you still have a question on this topic?

Topic		Replies	Views
Trigger_word_detection: Model design AI Discussions ai-discussions	0	53	February 8, 2024
C5W3A2-Reasoning Behind the Network Architecture? Sequence Models	1	235	December 28, 2023
Trigger Word Detection modelf function Dropout Rate Sequence Models	1	634	June 27, 2021
C5W3 trigger word detection assigment function5 Sequence Models	1	494	December 3, 2021
C5W3 - Missing intuition on positive dataset marking with trigger word detection Sequence Models week-3	14	209	June 1, 2024

[Week3: Trigger Word Detection] modelf architecture

Related topics