Trigger word detection from Zero

Attila_Ambrus · November 6, 2021, 4:37pm

Dear Community,

I tried to build a set of 4,000 examples from the sounds given in the course. And from 10-seconds Youtube audio snippets that I manually tagged, a development set. I did not get the result that was expected. I think the model cannot generalize based on the training set.

What can I do wrong?
Is there perhaps no detail that has been omitted from the curriculum?

Here you can see the history of the training:
Loss and F1 Score

AUC and Accuracy

Here you can see the output of the prediction and my result:
Prediction

My result

Thank You for your help!

Attila_Ambrus · November 7, 2021, 7:28pm

@staff Could you help me, please?

TMosh · November 8, 2021, 7:18am

I do not understand the data you presented.
You did not label the horizontal axis.
Why do some of the plots range from 0 to 100, and others range from 0 to 1400?

Attila_Ambrus · November 8, 2021, 1:00pm

Because the 100 is epoch, the 1371 (1400) is datapoint from the audio. These are autorange ticks.

TMosh · November 8, 2021, 4:07pm

Where did your “output of the prediction” plot come from?

Attila_Ambrus · November 8, 2021, 4:31pm

From the original pretrained model that is the sequence model part of deep learning specialisation.

Below that is the result of my model that I taught from zero.

TMosh · November 8, 2021, 6:05pm

How large are your training and test sets?
I’m not sure what you mean by “a development set”.

Attila_Ambrus · November 8, 2021, 7:44pm

In the course the traning set is 4000 and the development/validation set is 26 large. In my case the training set size is 4000 and the validation set is 50.

By the way the the training set with which you teach the model. The development set with which you test the goodness of different models or hyperparameters, etc. The test set is used for the last test before the model is moved to the final live environment. This was very well explained in the specialization.

Attila_Ambrus · December 24, 2021, 7:18am

I used Youtube data for training. But I used all the parameters as they were in the course. I tried to change the parameters but it didn’t get any better. I used data audmentation, a larger set of training, but nothing got better.

Can my data or spectrogtam be the cause of the error?

Attila_Ambrus · February 3, 2022, 5:40pm

Months have passed. I visited DeepLearning.AI at a number of contacts and they did not respond anywhere about their course. This is a bit of a characteristic of the company’s current mentality. They don’t pay attention to their students.

Topic		Replies	Views
Course 5 - Week 3 - Trigger Word Detection : Training from Scratch Sequence Models coursera-platform	3	717	December 24, 2021
C5W3 : Trigger word detection learning question Sequence Models coursera-platform	1	590	July 21, 2021
Trigger Word Detection PreTrained Model Sequence Models coursera-platform	1	535	July 31, 2021
My model predictions Neural Networks and Deep Learning coursera-platform	4	649	September 7, 2021
DLS - Course 5 - W3 - Trigger Word Detection Sequence Models coursera-platform	6	545	April 26, 2023

Trigger word detection from Zero

Related topics