Course 3 Week 1 Assignment 1 Training the Model

drew_Frances · April 11, 2022, 9:16pm

I get the following error while running train_model()

Step 1: Total number of trainable weights: 2327042
Step 1: Ran 1 train steps in 1.52 secs
Step 1: train CrossEntropyLoss | 0.70368373
Step 1: eval CrossEntropyLoss | 0.69825441
Step 1: eval Accuracy | 0.56250000

Step 10: Ran 9 train steps in 6.08 secs
Step 10: train CrossEntropyLoss | 0.65071505
Step 10: eval CrossEntropyLoss | 0.68158495
Step 10: eval Accuracy | 0.50000000
Wrong loss function. WeightedCategoryCrossEntropy_in3 was expected. Got CrossEntropyLoss_in3.
Wrong metrics in evaluations task. Expected [‘WeightedCategoryCrossEntropy’, ‘WeightedCategoryAccuracy’]. Got [‘CrossEntropyLoss’, ‘Accuracy’].
4 Tests passed
2 Tests failed

All previous tests pass. What should I be looking at?

Thanks,
Drew

balaji.ambresh · April 12, 2022, 8:21pm

Seems like the loss and accuracy are incorrect.

Use tl.WeightedCategoryCrossEntropy instead of tl.CrossEntropyLoss
Use tl.WeightedCategoryAccuracy instead of tl.Accuracy

drew_Frances · April 12, 2022, 8:53pm

Hi @balaji.ambresh :

Seems like the loss and accuracy are incorrect.

(I am assuming this also makes my exercise 8 test fails)

Thanks for responding. I am trying to figure out where I’m making a mistake. Perhaps one culprit is how I calculate example_weights in exercise 2. My lab id is tqxhipjp

Thanks,
Drew

balaji.ambresh · April 12, 2022, 9:57pm

Only course staff and site admins can access your lab by id.

Can you check the place where you create training.TrainTask and training.EvalTask?

drew_Frances · April 13, 2022, 2:43am

Hi @balaji.ambresh:

I don’t touch the code that defines training.TrainTask and EvalTask.

I was told my example_weights was wrong so, after re-reading, I assigned it to
example_weights = np.ones_like(targets). If this is wrong, what should I be referring to?

Thanks,
Drew

balaji.ambresh · April 13, 2022, 4:17am

Sorry for the confusion. What I meant was, check the place where you create instances of training.TrainTask and training.EvalTask.

Florent_Tho · August 23, 2022, 11:53am

Hello

I get the same error, but the TrainTask and EvalTask are created in a cell I’m not supposed to modify. (see below)
does that mean I should just ignore the failed test ?

PLEASE, DO NOT MODIFY OR DELETE THIS CELL

from trax.supervised import training

def get_train_eval_tasks(train_pos, train_neg, val_pos, val_neg, vocab_dict, loop, batch_size = 16):
[…]
metrics=[tl.CrossEntropyLoss(), tl.Accuracy()]

balaji.ambresh · August 23, 2022, 4:04pm

@Florent_Tho
Please click my name and message your notebook as an attachment.

Florent_Tho · August 24, 2022, 9:14am

my bad, I was using np.ones to generate the weights resulting in float.
once corrected to integer (as the data_generator test suggested), the error disappears.

I thus guess that the tl.CrossEntropyLoss auto-upgrades to WeightedCrossEntropyLoss when weights are available

I’m surprised, in my practice (ex. optimizing MonteCarlo simulations), weights were floats to get fine grained control.

sorry for the trouble!

shanup · August 28, 2022, 12:37am

Hello @balaji.ambresh

I am getting the similar error.

Wrong loss function. WeightedCategoryCrossEntropy_in3 was expected. Got CrossEntropyLoss_in3.
Wrong metrics in evaluations task. Expected [‘WeightedCategoryCrossEntropy’, ‘WeightedCategoryAccuracy’]. Got [‘CrossEntropyLoss’, ‘Accuracy’].

but the TrainTask and EvalTask have been pre-defined with ‘CrossEntropyLoss’ and, ‘Accuracy’

balaji.ambresh · August 28, 2022, 5:47am

@shanup
Please fix the task definition inside get_train_eval_tasks based on this feedback:

1st line is the expectation for training task. The test says that you should use WeightedCategoryCrossEntropy instead of CrossEntropyLoss
2nd line is the expectation for evaluation task. Use the expected to replace the ones you have.

Wrong loss function. WeightedCategoryCrossEntropy_in3 was expected. Got CrossEntropyLoss_in3.
Wrong metrics in evaluations task. Expected ['WeightedCategoryCrossEntropy', 'WeightedCategoryAccuracy']. Got ['CrossEntropyLoss', 'Accuracy'].

shanup · August 28, 2022, 8:12am

@balaji.ambresh

I already tried that. But then we have a mismatch with the expected values printed in the Jupyter notebook for the output of train_model (but this does not throw an error, though)

Step      1: Total number of trainable weights: 2327042
Step      1: Ran 1 train steps in 1.97 secs
Step      1: train WeightedCategoryCrossEntropy |  0.69622028
Step      1: eval  WeightedCategoryCrossEntropy |  0.69192076
Step      1: eval      WeightedCategoryAccuracy |  0.50000000

Expected output (Approximately)

Step      1: Total number of trainable weights: 2327042
Step      1: Ran 1 train steps in 1.54 secs
Step      1: train CrossEntropyLoss |  0.70209986
Step      1: eval  CrossEntropyLoss |  0.69093817
Step      1: eval          Accuracy |  0.50000000

I also checked the unit testcases and found this in the exception block

try:
        assert test_func(output_loop) == [
            "WeightedCategoryCrossEntropy",
            "WeightedCategoryAccuracy",
        ]  # ["CrossEntropyLoss", "Accuracy"]
        successful_cases += 1

It looks like the code in the unit testcase has been commented out for CrossEntropyLoss", "Accuracy. So we can pass the unit testcases by modifying the code in train_task and eval_task to the Expected values of [‘WeightedCategoryCrossEntropy’, ‘WeightedCategoryAccuracy’]. However, the expected values after running train_model are still looking for [‘CrossEntropyLoss’, ‘Accuracy’]

Seems like a mismatch all around between the expected values in the unit testcases and the expected values for the train_model output printed in the Jupyter notebook.

balaji.ambresh · August 28, 2022, 8:33am

You should talk to an NLP mentor. I don’t have access to the current code. Adding
@arvyzukai

shanup · August 28, 2022, 10:01am

Hello @arvyzukai

Could you please take a look at this issue. Looks like a typo that needs to be fixed in the assignment notebook.

arvyzukai · August 29, 2022, 7:27am

Hi, @shanup

Thank you for reporting the issue. There were a lot of code changes in August and some bugs came out as a result. The staff will try to fix it as soon as possible.

Please send me your Jupyter Notebook Assignment code so I could check how you got the mismatched output.

larryleguo · August 30, 2022, 3:51am

@ arvyzuka
I have the same issue.

sogatira · August 30, 2022, 5:36pm

Hello!

I am facing with the same error.
My ID lab is dfoqyqjk

Thanks in advance for your help.

Vincenzo_Lavorini · August 31, 2022, 11:49am

Hello, same here, id skljxmaf .

More: if this is wrong, how can we trust the rest of the notebook?

Jose_Leal_Domingues · September 2, 2022, 2:12am

Hi,

I wanted to report that I got a notification saying that the notebook was changed and after using the new version and double-checking all the implementation code, I’m seeing some suspicious dimension mismatch. This is the error I’m getting when trying to train the model:

TypeError: mul got incompatible shapes for broadcasting: (32, 2), (16, 2).

There are a bunch of cells that we are not supposed to touch and after double-checking all possible values I can’t seem to really find the issue. Can that be an issue with this latest notebook update?

Vincenzo_Lavorini · September 2, 2022, 7:34am

Hi to all, I confirm that now everything works good. Thank you!

Topic		Replies	Views
Problem validating Exercice 6 of C3_W1 (train_model function) NLP with Sequence Models week-1	4	688	December 13, 2022
C3 W3 Assignment - Training: low accuracy NLP with Sequence Models week-3	2	527	September 21, 2022
W1 Q6 accuracy not converging NLP with Sequence Models week-1	2	537	May 21, 2022
C3W3 Issues with Exercise 4: Classify NLP with Sequence Models week-3	4	24	February 16, 2025
Course 3 week 1 exercise 6 NLP with Sequence Models week-1	44	1212	February 3, 2024

Course 3 Week 1 Assignment 1 Training the Model

PLEASE, DO NOT MODIFY OR DELETE THIS CELL

Expected output (Approximately)

Related topics