C5W3 : Trigger word detection learning question

Fredo · July 21, 2021, 9:39am

Hello,
In the trigger word detection problem, i noticed that for the defined architecture there are 522425 trainable parameters. And that the load pre-trained model has been trained with only 4000 examples. I wonder why that training fits so well considering that 522425 >> 4000 (the number of unknown is about 2 order greater than the number of examples). Moreover in the first courses of deep learning specialization we were instructed that the number of examples should be very large for deep learning. Is it a special case for trigger word detection ? or is it usual for sequence model ? or is it not a deep learning example ?
Thanks in advance for your answer,
Frédéric

TMosh · July 21, 2021, 4:26pm

If you have more features than examples, it’s a ready-made situation for an over-fit solution. It will give a low training error, but won’t generalize to new data very well.

One reason the training set is small in the exercise: Using a bigger training set (which you would in a real application) would take an extremely long time.

Topic		Replies	Views
Trigger Word Detection PreTrained Model Sequence Models	1	534	July 31, 2021
Course 5 - Week 3 - Trigger Word Detection : Training from Scratch Sequence Models	3	717	December 24, 2021
DLS - Course 5 - W3 - Trigger Word Detection Sequence Models	6	542	April 26, 2023
C5W3 trigger word detection assigment function5 Sequence Models	1	494	December 3, 2021
C5W3A2 Trigger word Detection - Why more positives? Sequence Models	1	420	July 14, 2023

C5W3 : Trigger word detection learning question

Related topics