W4 A1 | Transformers: scaled dot assessment

Irina_Malkin_Ondik · December 6, 2021, 5:03pm

Hi everybody.

I feel very stupid (and I am sure I have a bug in the notebook).

My printed “output” after the scaled dot assement cell:

tf.Tensor(
[[0.2589478 0.42693272 0.15705977 0.15705977]
[0.2772748 0.2772748 0.2772748 0.16817567]
[0.33620113 0.33620113 0.12368149 0.2039163 ]], shape=(3, 4), dtype=float32)

assessment comments:

AssertionError Traceback (most recent call last)
in
1 # UNIT TEST
----> 2 scaled_dot_product_attention_test(scaled_dot_product_attention)

~/work/W4A1/public_tests.py in scaled_dot_product_attention_test(target)
60 assert np.allclose(weights, [[0.2589478, 0.42693272, 0.15705977, 0.15705977],
61 [0.2772748, 0.2772748, 0.2772748, 0.16817567],
—> 62 [0.33620113, 0.33620113, 0.12368149, 0.2039163 ]])
63
64 assert tf.is_tensor(attention), “Output must be a tensor”

AssertionError:

Any hints on how to debug from here would be appreciated.

Edited: Noticed the missing “,” in my output. Will try to debug.

Jan_Kieres · January 14, 2022, 2:01pm

Hi everyone,
I have exactly the same error: “assert tf.is_tensor(attention), “Output must be a tensor””
I double checked my output with
print(tf.is_tensor(output),tf.is_tensor(attention_weights))
and it is showing that the output is a tensor. However, I’m still getting the error message.
Because of this error when I submit my code I got 0 points because of another error:
“Cell #13. Can’t compile the student’s code. Error: AssertionError()”

Can anyone help with that?

Rashmi · July 1, 2022, 9:56am

Hi, Jan Kieres.

If that’s the case, then try restarting the kernel by following the correct method and then check the exact error and share it with us. Thanks!

liuzifeng · July 10, 2022, 2:07am

Hi, everyone.
I got the same error. I tried restart the kernel but it didn’t work. Have u got any solution?
Can anyone help us on this ?

liuzifeng · July 10, 2022, 2:14am

I miscalculated with dk = k.ndim, it should be seq_len_k

David_Abrams · September 26, 2022, 6:23pm

I am getting the same error: assert tf.is_tensor(attention), “Output must be a tensor”

I’m using seq_len_k as you suggested above:
scaled_attention_logits = tf.math.divide(matmul_qk, tf.math.sqrt(seq_len_k*1.0))

I validated that I am outputting tensors:

print(f" output={type(output)}  {output.shape}")
print(f" attention_weights={type(attention_weights)}  {attention_weights.shape}")

# END CODE HERE

return output, attention_weights

output=<class ‘tensorflow.python.framework.ops.EagerTensor’> (3, 2)
attention_weights=<class ‘tensorflow.python.framework.ops.EagerTensor’> (3, 4)

please help

Rashmi · October 11, 2022, 10:51am

Hello all!

Welcome to the community.

Please go through this link on how to resolve the issues related to scaled dot product attention.

Topic		Replies	Views
W4, scaled_dot_product_attenion, Output must be a tensor Sequence Models coursera-platform	2	472	August 3, 2023
DLS course 5 w4 scaled dot product attention Sequence Models coursera-platform	3	893	August 20, 2021
C5 W4 A1:Scaled dot product attention error Sequence Models coursera-platform	5	856	March 25, 2023
DLS Course 5 W4 A1 Exercise 3 Sequence Models coursera-platform	2	586	March 15, 2022
C5W4A1E3 Transformer Architecture (scaled_dot_product_attention) issue Sequence Models week-4 , coursera-platform	8	56	April 16, 2025

W4 A1 | Transformers: scaled dot assessment

My printed “output” after the scaled dot assement cell:

tf.Tensor( [[0.2589478 0.42693272 0.15705977 0.15705977] [0.2772748 0.2772748 0.2772748 0.16817567] [0.33620113 0.33620113 0.12368149 0.2039163 ]], shape=(3, 4), dtype=float32)

Related topics

tf.Tensor(
[[0.2589478 0.42693272 0.15705977 0.15705977]
[0.2772748 0.2772748 0.2772748 0.16817567]
[0.33620113 0.33620113 0.12368149 0.2039163 ]], shape=(3, 4), dtype=float32)