C4W1_Assignment - Translate Function

Manjunath_RN · February 26, 2024, 11:09pm

Hi - In the Translate function… dont we need to pass a different next_token each time ?

Iterate for max_length iterations

for _ in range(max_length):
    # Generate the next token
    try:
        next_token, logit, state, done = generate_next_token(
            decoder=model.decoder,
            context=context,
            next_token=next_token,
            done=done,
            state=state,
            temperature=temperature
        )
    except:
         raise Exception("Problem generating the next token")

Unit test results
Failed test case: translate didn’t return the same logit when using temperature of 0.0.
Expected: -0.6533634066581726
Got: -0.5493094921112

arvyzukai · February 27, 2024, 5:50am

Hi @Manjunath_RN

We do pass next_token each time. The problem is somewhere else. Maybe you do not break out of the loop when “done”?

Manjunath_RN · February 27, 2024, 7:37pm

Thanks and Yes, I do break out

if done:
break

arvyzukai · February 28, 2024, 5:43am

Then maybe you do not initialize the initial state to zeros?

The example you’re given above the exercise, initializes the initial state with random uniform, so maybe you copied that line instead of modifying it to return all zeros?

Manjunath_RN · February 28, 2024, 8:03pm

That was it. Thank you for the clarification!

Finished the nlp module!

Dennis_Sinitsky · March 14, 2024, 7:25pm

I wondered, why did TA recommend tf.zeros in the beginning of the exercise… Now I know

Topic		Replies	Views
C4W1_Assignment Translator function take in decoder or model.decoder NLP with Attention Models week-1	3	49	October 29, 2024
C4W1_Assignment - Exercise 5 NLP with Sequence Models week-1	41	1366	May 28, 2024
All previously generated tokens as decoder input or only the latest generated token as decoder input NLP with Attention Models week-1	2	32	July 14, 2024
C4_W1_Q5: Test Inconsistency NLP with Attention Models week-1	5	308	January 3, 2024
C4W1 NMT with Attention(tensorflow) Assignment, Exercise 5 - translate - generate "eu eu eu " NLP with Attention Models week-1	17	464	August 7, 2024

C4W1_Assignment - Translate Function

Iterate for max_length iterations

Related topics