Any reasonable hints regarding the assignment?

Taranovski_Alex · July 18, 2023, 4:41am

Hi Everyone,

Well… here we go again…

I am doing the course #2 week #3 assignment, I genuinely tried multiple different model architectures, using sequence of convolutions, lstm.
Every training takes at least half an hour, the best slope I could achieve is 0.002, which is 4 time larger than required 0.0005.
Every time the loss starts lower and then inevitable rises.

The latest architecture I have takes 1 hour to train.

Could anyone provide some reasonable hints about how to beat this one?

gent.spah · July 18, 2023, 8:26am

Hi @Taranovski_Alex i am not a mentor for this but as far as I remember the assignment is similar to the lab, maybe the model is a bit larger -extended, but not much different.

bruno_ramos_martins · July 18, 2023, 11:33am

Hello @Taranovski_Alex ! Following the same idea as the example of the week, I used the model architecture and made some changes to try to understand what improves or worsens performance. During my tests, I didn’t wait for the model to complete its run because it’s possible to get a preview of whether it’s improving or not in the output of each epoch. My suggestion is to revisit the example of the week and make small changes to identify what can contribute.

Taranovski_Alex · July 23, 2023, 4:04am

Thanks Everyone for the hints!

Topic		Replies	Views
Model is not learning Natural Language Processing in TensorFlow week-2 , week-3 , week-4	3	539	June 10, 2022
T1_C3_Week 4 Assignment Natural Language Processing in TensorFlow week-4	3	331	January 9, 2023
NLP in TensorFlow Week 4 Assignment Generative AI with Large Language Models week-4	3	25	August 3, 2024
Issues with training Generative Deep Learning with TensorFlow week-3 , assignment	6	50	January 30, 2025
Week 3 Assignment - help with interpreting results Natural Language Processing in TensorFlow	2	338	December 22, 2022

Any reasonable hints regarding the assignment?

Related topics