Course 4, week 4 Neural Style Transfer: train_step "Unexpected cost for epoch 0"

tyler_norway · December 23, 2021, 5:33pm

I came across this on Stackoverflow. python - Jupyter uses an outdated version of a function when asked to import from a file - Stack Overflow.

I know they have issues with imported functions from a Python file. However, my theory is, IPython might keep a cache for functions and variables. In this case, that cached version seems not to be updated even though I had complied the cells.

Another “conspiracy theory”, I blame on Python’s lazy evaluation. Seems like both **2 and tf.square, theoretically, has the same presentation in assembly. Therefore, Python just ignores the new version and doesn’t compile the new code. Hence, clear all seems to empty that cache and forces Python to start from scratch.

paulinpaloalto · December 23, 2021, 7:12pm

Oh, yes, I didn’t think about that aspect. It is definitely true that if you change one of the python files parallel to the notebook that is imported, it does not take effect until you restart the kernel and rerun the actual “import” command. But why would you be changing those? That’s a pretty dangerous thing to do. The grader is a black box and I literally have no idea whether it depends on the specific contents of public_tests.py, e.g., but I’ve got a bad feeling it does.

As to lazy evaluation, I don’t buy that theory. There’s no way **2 and tf.square would be seen as equivalent by the interpreter. Maybe at the very very leafest (if that’s a word) level, but nothing in the call graph above that is the same. Note that the operands to **2 are integer constants, so the interpreter will use python integer operations to implement that, not TF operations. At least I would bet you all the beer you can drink in one sitting that is true. Prost!

BTW thanks for reminding me about the issue with changing python utility files. I think we need a topic on the FAQ Thread about that.

paulinpaloalto · December 30, 2021, 4:05pm

I just read over this whole thread and noticed that we didn’t really conclusively explain why it doesn’t work with tf.square but it does work with **2. There is one post on this thread that actually does describe the error here, but it’s worth also pointing to this other thread which gives a demonstration of the difference.

Peter_Darrell · March 23, 2022, 8:06pm

This one frustrated me for a week. I was pretty sure I had the maths laid out correctly but tf.square and **2 both threw errors (and gave me the same *e^-10 value for J_style_layer.
Eventually I stopped using the “1/4…etc” format and just started with the second term (tf.reduce_sum) and divided that by the 4 *… etc.
Suddenly I have the right output.

Gary_Hess · July 29, 2022, 9:02am

Same error for me, but as described above, I ran “Restart & Clear Output” and suddenly “UNQ_C5” passed!

I noticed before passing that each time I reran “UNQ_C5”, the output values diminished. Is this possibly due to overwriting the global variables somewhere?

anon57530071 · July 29, 2022, 9:08am

Welcome to the community.

I think this thread covers. Please refer it.

Topic		Replies	Views
Week 4 Neural Style Transfer: train_step "Unexpected cost for epoch 0: 2555.3056640625 != 10221.168" Convolutional Neural Networks	3	600	July 28, 2021
Art generation neural style assignment: train_step Convolutional Neural Networks	2	650	September 18, 2022
Exercise 6 - train_step Convolutional Neural Networks	4	656	August 7, 2021
Neural Style Transfer - Course 4 Week 4 Assignment 2 Convolutional Neural Networks week-4	4	25	January 16, 2025
Week 4 error in last graded part Convolutional Neural Networks	1	523	September 21, 2021

Course 4, week 4 Neural Style Transfer: train_step "Unexpected cost for epoch 0"

Related topics