Course 4: Week 4 - Assignment 2 - Exercise 6

Khiem_Viet_Ngo · February 24, 2022, 2:19am

Hello, I’m on Course 4, Week 4, Assignment 2, Exercise 6.

What’s wrong with the following code ? Print statements showed that a_C, a_S, and a_G are of the same shape (1, 400, 400, 64).

Any idea ? Thanks!

TMosh · February 24, 2022, 2:42am

Please remove your code from your message - sharing your code isn’t allowed.

What makes you believe there is an error in your code?

Khiem_Viet_Ngo · February 24, 2022, 2:44am

Thanks for your reply. I got this assertion error:

AssertionError                            Traceback (most recent call last)
<ipython-input-27-c4cbeb243f35> in <module>
      6 print(J1)
      7 assert type(J1) == EagerTensor, f"Wrong type {type(J1)} != {EagerTensor}"
----> 8 assert np.isclose(J1, 25629.055, rtol=0.05), f"Unexpected cost for epoch 0: {J1} != {25629.055}"
      9 
     10 J2 = train_step(generated_image)

AssertionError: Unexpected cost for epoch 0: -317876320.0 != 25629.055

TMosh · February 24, 2022, 2:53am

The problem is that your cost is wrong - there’s no reason to suspect that the sizes are incorrect.

Most likely the cost returned by compute_style_cost() or compute_content_cost() is incorrect.

The first indication that your cost is incorrect is that it’s a negative number. Cost must always be >= zero.

Khiem_Viet_Ngo · February 24, 2022, 3:06am

I wouldn’t work on Ex 6 if one of my previous exercises was wrong. I got “All tests passed” from Exercises 1 - 5. The compute_style_cost() function (Ex 4) has already been implemented by given. The compute_content_cost() function (Ex 1) got “All tests passed” and I saw that the expected output matched mine.

As you can see, both cost functions (content & style) depend on the three parameters a_S, a_C, and a_G. If so, if either cost is wrong, then one of (a_S, a_C, or a_G) is wrong then. However, for Ex 6, I was asked to use the global variables a_S and a_C while a_G is the only computed parameter: a_G = vgg_model_outputs(generated_image) . Can’t tell which is wrong here!

TMosh · February 24, 2022, 3:42am

“All tests pass” does not mean your code is perfect.

The unit tests don’t catch every possible error - they actually don’t catch very many errors at all.

TMosh · February 24, 2022, 3:47am

compute_style_cost() is provided code, but it calls compute_layer_style_cost(). That’s one place the problem could be.

Khiem_Viet_Ngo · February 24, 2022, 3:51am

I appreciate your assistance so far! Regarding the compute_layer_style_cost(a_S, a_G) function, here’s what I have (you can remove my code after seeing it but this is the only way that I can tell you my problem):

# Retrieve dimensions from a_G (≈1 line)
    _, n_H, n_W, n_C = a_G.get_shape().as_list()
    
    # Reshape the images from (n_H * n_W, n_C) to have them of shape (n_C, n_H * n_W) (≈2 lines)
    a_S = tf.reshape(tf.transpose(a_S), shape=[n_C, n_H * n_W])   
    a_G = tf.reshape(tf.transpose(a_G), shape=[n_C, n_H * n_W]) 

    # Computing gram_matrices for both images S and G (≈2 lines)
    GS = tf.matmul(a_S, tf.transpose(a_S))
    GG = tf.matmul(a_G, tf.transpose(a_G))

    # Computing the loss (≈1 line)
    numerator   = tf.reduce_sum(tf.square(tf.subtract(GS, GG)))
    denominator = tf.cast(tf.square(2 * n_H * n_W * n_C), tf.float32)
    
    J_style_layer = numerator / denominator

And I got the same result as the expected output for J_style_layer (= 14.017805)

Please let me know what’s wrong with the above code. Many thanks!

paulinpaloalto · February 24, 2022, 4:24am

Why do you write out the computation of the gram matrices? You built a function to do that for you, right?

I have seen problems in the past (including negative cost values) on this function if you use TF integer arithmetic with n_H and so forth. The code you have written looks like it should work, but another thing to try is just to use normal python arithmetic or numpy operations for the scalar quantities and see if that helps.

Khiem_Viet_Ngo · February 24, 2022, 4:32am

Good to see you Paul!

You are right, I was blinded, I should have called gram_matrix(a_S) and gram_matrix(a_G) instead.

I will try your recommendation. Thanks!

Topic		Replies	Views
Exercise 6 - train_step Convolutional Neural Networks	4	656	August 7, 2021
CNN Week 4 Assigment 2, Excercise 6, train_step Convolutional Neural Networks	5	772	September 23, 2021
Week 4 error in last graded part Convolutional Neural Networks	1	523	September 21, 2021
Art generation neural style assignment: train_step Convolutional Neural Networks	2	650	September 18, 2022
Week 4 Assignment 2 Exercise 6 error Convolutional Neural Networks	2	542	January 5, 2022

Course 4: Week 4 - Assignment 2 - Exercise 6

Related topics