Week 4 Assignment 2 Exercise 6 Issue

Chris_Nienart · July 12, 2024, 5:29pm

In the first expression, a * b is int32 multiplication. In the second expression, a * b is float32 multiplication. I’m guessing that the type differences are messing up the gradients. From my understanding the gradient tape is watching for operations, so the order of operations and casting matters for backprop.

I didn’t do a deep dive, but the documentation on Tensorflow type promotion mentions dunder (double underline) operations where the math goes wrong due to bit-widening.

Did you read my example thread (linked several times earlier on this thread) where I showed some of the possible ways to get errors here?

Yes! That thread helped me get to a successful submission for my assignment.

paulinpaloalto · July 12, 2024, 7:28pm

Thanks for adding the point about the potential effect on the gradients. I had not thought of that. Note that none of the variables in question here are mutable, so are not directly affected by backprop, but they would be factors. You can see in my example thread that I used numpy or straight python for the integer arithmetic pieces in some of the formulations and it all still works fine. Normally if you insert a numpy operation anywhere in the compute graph that matters, then it “throws” in an obvious way at gradient.tape time. E.g. even if you do something as simple as use np.transpose where you should use tf.transpose it will fail. Here’s an example of that from DLS C5 which also points to another case in DLS C2 W3.

Topic		Replies	Views
Week 4 Assignment 2 Exercise 6 error Convolutional Neural Networks	2	539	January 5, 2022
CNN Week 4 Assigment 2, Excercise 6, train_step Convolutional Neural Networks	5	769	September 23, 2021
Exercise 6 - train_step Convolutional Neural Networks	4	654	August 7, 2021
C4 Week 4 Exercise 6 - train_step Convolutional Neural Networks week-4	12	33	December 18, 2024
Course 4, week 4 Neural Style Transfer: train_step "Unexpected cost for epoch 0" Convolutional Neural Networks	25	1521	July 29, 2022

Week 4 Assignment 2 Exercise 6 Issue

Related topics