Question on trigger word detection

Peeta_Li · January 24, 2022, 5:56am

Hello,

I’m slightly confused by the trigger word detection model in week 3. Based on the instruction, it seems like the model should be able to detect the trigger word immediately after it is said, which is the main reason for it to use a unidirectional instead of a bidirectional RNN – “If we used a bidirectional RNN, we would have to wait for the whole 10sec of audio to be recorded before we could tell if “activate” was said in the first second of the audio clip”. However, Isn’t it the case that we will have to wait the whole 10 secs of audio to be finished first before we could pass that into the 1D CONV layer? If so, how could it be detected immediately after it is said?
Thank you in advance!

TMosh · April 21, 2022, 6:53am

Hopefully you were able to find an answer to your question.

Topic		Replies	Views
C5W3: Is real-time detection really possible? Sequence Models coursera-platform	3	568	September 12, 2021
C5_W3_A2 Question about the architecture Sequence Models week-module-3 , coursera-platform	7	22	September 3, 2024
Real-time Trigger Word Detection Sequence Models coursera-platform	1	498	April 28, 2022
DLS - Course 5 - W3 - Trigger Word Detection Sequence Models coursera-platform	6	544	April 26, 2023
Sequence Models Week 3 Assignment 2 Question Sequence Models coursera-platform	3	387	August 22, 2023

Question on trigger word detection

Related topics