How long does it take to run mbr_decode in C10?

eggstone · February 11, 2023, 3:00pm

I got stuck in that single mbr_decode takes more than 5 or even 10 minutes to run, and is there something run with it? Even if I just run the generate_samples(), it will succeed, but sometimes return within within a minute, sometimes taking an extremely long time to run.

I have tried to run C10 overnight, it didn’t return any errors (until I break it), but just kept running.

arvyzukai · February 12, 2023, 12:53pm

Hi @eggstone

You might have some inefficient code somewhere. For me, it takes around 15 seconds:

You could download your notebook and private message me with it attached. I could try to pinpoint the problem.

Cheers

lbaiao · March 1, 2023, 10:59am

@arvyzukai my mbr_decode function is taking too long to run as well. Can I send you my notebook?

arvyzukai · March 1, 2023, 11:08am

@lbaiao no problem, please send it and I will try to help.

arvyzukai · March 1, 2023, 12:53pm

For future readers:

One common mistake, that could make the code run extremely long is in # UNQ_C6 next_symbol function definition - when calculating log_probs → do not use -1 in the second dimension:

# get log probabilities from the last token output
log_probs = output[None]

The hint in the notebook:

The log probabilities output will have the shape: (batch size, decoder length, vocab size). It will contain log probabilities for each token in the cur_output_tokens plus 1 for the start symbol introduced by the ShiftRight in the preattention decoder. For example, if cur_output_tokens is [1, 2, 5], the model will output an array of log probabilities each for tokens 0 (start symbol), 1, 2, and 5. To generate the next symbol, you just want to get the log probabilities associated with the last token (i.e. token 5 at index 3). You can slice the model output at [0, 3, :] to get this. It will be up to you to generalize this for any length of cur_output_tokens

In other words, do not use -1 to select decoder length dimension - use length of the current output tokens which you stored in an appropriate variable.

Topic		Replies	Views
C4_W1 mbr_decoder taking very long time to finish NLP with Attention Models week-module-1	3	505	December 9, 2022
NLP C4_W1 UNQ_C10 Oddly slow runtime NLP with Attention Models week-module-1	6	602	May 26, 2022
C4 W1 UNQ_C6 Stuck NLP with Attention Models week-module-1	2	598	December 8, 2022
In C10 all my translated sentences have double .. or ! at the end NLP with Attention Models week-module-1	6	427	November 3, 2023
How long does it take for UNQ_C1 to run? NLP with Probabilistic Models week-module-2	2	464	May 21, 2023

How long does it take to run mbr_decode in C10?

Related topics