C5w1 - Assignment 2 - Exercise 2. The sampling

Hello everyone,

Could you please help me avoid this error (see the attachment). How should I do to make sure that the probability vector y sums to 1?

This is how I computed the probabilities: y = softmax(z), and
how I sampled the indexes idx = np.random.choice(range(len(z.ravel())), p = y.ravel()).
I raveled both y and z to make sure they are of the same shape here (otherwise I was getting an error for the difference in their shapes)

Many thanks

1 Like

Hi @Bertrand_T_Tameza ,

The instruction in line 56 specified the sampling should be done within the vocabulary from the probability distribution y, so the first parameter for the np.random.choice() is range(vocab-size) and not range(len(z.ravel()). z is the activation output.

1 Like

Thank you @Kic, I have corrected the error, but now receiving another error. Don’t know what I did wrong because my implementation of a is correct np.tanh(np.dot(Wax, x) + np.dot(Waa, a_prev) + b)

Thank you @Kic, the problem is solved!

Hi Everyone, I’m running into this same problem and stuck. Please help.
And why would “a” be the same size as “p” anyway?

1 Like

Please share your full error.

Here it is. thanks

Hi @Kin_Yue,

Check this page out, and look for the explanation there for why their shapes have to be the same.

For your error, read this thread in whole for some hints.

Raymond

i also get this. When i add up all the probabilites, i get…
y values sum up to :0.9999999999999998 !

i understand why they must be same size. I used this, but also got the same error

Where i am lost on this exercise, is what are the correct shapes for all the tensors. For example, are x, a_prev both 2 d tensors like this?

my y shape looks like this Can someone tell me if this looks wrong?

hees is the output…


…etc…

i also have an unrelated question about this line. I am guessing it is one hot encoded, so only 0 or 1 values allowed. But can someone confirm?

Many thanks in advance!!

Also, this error in these jupyter notebooks is misleading

it always says len(y.ravel()…etc even though that is no longer the code that i have. Almost like some broken cache somewhere. And ive tried restarting kernel etc

ok im back to the original error, when i use this code:
image

…and here is the error…

Case closed. problem was nothing to do with np.random.choice,
rather a subsequent line. all good now! Sorry for all the CRUD. :slightly_smiling_face: