C5W3 modelf

rozenn · November 2, 2023, 1:08pm

Hello community
For the exercise where we have to code modelf, I am having this error message

I don’t see where my error is. I have used the global layers without modifying them.

for getting the context, I used one_step_attention with a, s
for the post activation LSTM cell, I used context as inputs and [s, c] as initial state
for out, I used ouput_layer with inputs=s
for model, I used Model with inputs=[X, s0, c0] and ouputs=outputs

Thanks a lot for you help!

saifkhanengr · November 2, 2023, 2:10pm

I doubt the issue is with Step 1: Define your pre-attention Bi-LSTM. Double-check how you are coding that. I think you need to specify the input shape in Bidirectional.

saifkhanengr · November 2, 2023, 2:20pm

I just checked my assignment, it is not necessary to pass the input shape to Bidirectional.

Have you specified the return_sequences to true in step 1? If so, have you passed the previous test, one_step_attention?

rozenn · November 2, 2023, 4:21pm

Yes, I have set the return_sequences to True
And yes, I passed the test for one_step_attention

TMosh · November 2, 2023, 5:39pm

I guess the problem is either in Step 1 of modelf(), or it’s in one_step_attention() where you compute the value for s_prev.

Or, somewhere you could have added some code that was not necessary.

Maybe don’t use the “inputs=” tag there.

rozenn · November 3, 2023, 8:38am

Hi,

I tried to remove the tag “inputs=” but same error.
In step 1 of modelf, I use Bidirectional(LSTM(units=Tx,return_sequences=True))(X)
I passed all the test for one_step_attention: I used s_prev = repeator(s_prev)

saifkhanengr · November 3, 2023, 10:33am

Here you go. Tx is the length of the input sequence and it is not the same as units. Think about it. Hint: what is the difference between the length of the input sequence and the hidden state size?

rozenn · November 3, 2023, 11:26am

Thank you! Indeed, it was my mistake.

Topic		Replies	Views
C5W3A1 input error in modelf() Sequence Models coursera-platform	4	722	March 24, 2022
Course 5 Week 3 Assignment 1 modelf Sequence Models coursera-platform	34	1061	September 1, 2024
Week3 Assignment 1: Function modelf Sequence Models coursera-platform	1	942	June 15, 2021
Pre and Post-attention LSTM cells in Week 3 Assignment 1 Sequence Models coursera-platform	2	542	January 14, 2023
W3 A1: Neural_machine_translation_with_attention Exercise 2 - modelf Sequence Models coursera-platform	3	966	October 10, 2022

C5W3 modelf

Related topics