Real-time Trigger Word Detection

clovis · April 28, 2022, 10:19am

I don’t understand how can a Trigger Word Detection algorithm detects at real time. The algorithm we’ve implemented in the programming assignment needs to input an audio clip of 10 seconds. I know with the unidirectional architechture, the front part of the algorithm can make the prediction independently. But how should we adjust the real time input to make real-time predictions?

TMosh · April 28, 2022, 2:42pm

I think this would work:
Buffer the most recent 10 seconds of audio.
As each new sample arrives…

Discard the oldest sample and add the new one.
Compute the spectrogram.
Pass the spectrogram through the model.

Topic		Replies	Views
C5W3: Is real-time detection really possible? Sequence Models coursera-platform	3	568	September 12, 2021
Question on trigger word detection Sequence Models coursera-platform	1	540	April 21, 2022
DLS - Course 5 - W3 - Trigger Word Detection Sequence Models coursera-platform	6	544	April 26, 2023
W3_Trigger Word Detection with distributed time wrapper Sequence Models coursera-platform	5	605	April 16, 2023
Week3 Programming Assignment: Trigger Word Detection Sequence Models coursera-platform	2	508	September 6, 2022

Real-time Trigger Word Detection

Related topics