Course 5 Week 1 Assignment 2 Exercise 2

psteele · June 21, 2021, 5:47pm

Hi, i’m hitting an error related to the numpy.random.choice function that seems to have to do with probabilities that are slightly above the tolerance for this function. I get the following error:

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-78-a13d660e654a> in <module>
     19     print("\033[92mAll tests passed!")
     20 
---> 21 sample_test(sample)

<ipython-input-78-a13d660e654a> in sample_test(target)
      7 
      8 
----> 9     indices = target(parameters, char_to_ix, 0)
     10     print("Sampling:")
     11     print("list of sampled indices:\n", indices)

<ipython-input-77-ff3cc52aa630> in sample(parameters, char_to_ix, seed)
     57         # (see additional hints above)
     58         probs = y.ravel()
---> 59         idx = np.random.choice(range(len(probs)), p = probs)
     60 
     61         # Append the index to "indices"

mtrand.pyx in numpy.random.mtrand.RandomState.choice()

ValueError: probabilities do not sum to 1

I’ve tried normalizing the distribution first but it still seems to run into the same error. What am I missing?

vjmalkoti · June 21, 2021, 7:31pm

See the hints provided above the function’s cell. The y array is two dimensional. You need to flatten it using ravel function.

psteele · June 21, 2021, 8:34pm

I’ve flattened it with y.ravel() already - see line 58 in the code above.

psteele · June 21, 2021, 8:54pm

When i check the shape of y it is (27, 100) and when i check the sum of y.ravel() i get ~100.0000000024.

Since the random.choice function requires that the probabilities sum to 1, i’m trying to normalize the values. When i do that - the id selected is greater than size of x.

TMosh · June 21, 2021, 9:26pm

range(len(probs))

You’re using the wrong variable for the length.

The Step 3 instructions say “…from the probability distribution y”

TMosh · June 21, 2021, 9:34pm

That might not be the problem, I tried your method and it seems to be OK.
But I recommend you look if you used softmax() correctly to compute y.

Because the error message says your probabilities don’t add to 1, but that’s what softmax() should do.

psteele · June 21, 2021, 9:51pm

I think it comes down to a numpy summation error because of rounding the values do not sum appropriately, so i had to scale it manually. The softmax() works fine but the summation of 100 sets of ‘1’ comes out to a +/- 1 and exceeds the tolerance of the random.choice function.

TMosh · June 22, 2021, 3:11am

Do not scale the results manually. That is not necessary if you wrote the correct code.

Suhail_Amiri · October 14, 2021, 4:48pm

OP did you find a solution? I am having the same problem.

Suhail_Amiri · October 14, 2021, 9:54pm

I figured it out. It’s because you have to give x and a_prev proper 2D dimensions when initializing them so as to get y of shape (vocab_size,1). Currently you are getting y of shape (vocab_size, n_a). In such a case, while each column sums to 1, the sum of all elements of y will be approximately = 1 * number of cols = 1 * n_a = 100.

YASH_SHETTY · October 21, 2021, 8:13pm

then dimensions of x,a_prev,y should we keep in order to remove the valueerror?

Shalva · November 23, 2021, 11:04pm

Hey All,

Just adding my input in case other people encounter the same issue.
I had a problem with the probability reaching 1.
As mentioned by @Suhail_Amiri, this was resolved by explicitly assigning strict vector shapes when initializing x & a_prev. Basically I changed the code from x = np.zeros(27) → x = np.zeros((27,1)).

jackchan.hk · September 9, 2022, 1:01pm

Dear All,

I think the problem is not related to the dimension of x and a_prev. It should be related to the softmax function defined in the course.

Below is the proof. You could see that after using scipy softmax, even using 1D x and a_prev, we could still input into the random.choice function without error.

Notice it’s not related the numerical round up error. The proof below show that random.choice do accept the summation very close to 1, not exactly 1.

bblanc · July 21, 2023, 11:29pm

Then, what is the conclusion?
Is it an issue with the softmax implemented in the course?

saifkhanengr · July 22, 2023, 2:27am

Hello @bblanc! This thread is too old, so, please create a new post and share your full error.

rianrizvi · November 7, 2023, 9:24pm

If x is initialized with shape (vocab_size) then y will have shape (vocab_size,n_a), but if you make it a 2-D matrix (vocab_size,1) it will give y.shape = (vocab_size,1)

Details:
x:(27,), Wax@x:(100,), a_prev:(100, 1), Waa@a_prev:(100, 1) => a:(100, 100), Wya@a:(27, 100), z:(27, 100), y:(27, 100)
but
x:(27, 1), Wax@x:(100, 1), a_prev:(100, 1), Waa@a_prev:(100, 1) => a:(100, 1), Wya@a:(27, 1), z:(27, 1), y:(27, 1)

In the first case Wax@x gets broadcast up in Waa@a_prev + Wax@x + b to (100,100).

Maxwell_Shapiro · December 11, 2023, 1:11pm

Just know that there are two places to initialize x, before the while loop and in the while loop. Just make sure both are 2D!

Topic		Replies	Views
Week 1 Assignment 2 - Sample Sequence Models	5	563	December 21, 2021
C5w1 - Assignment 2 - Exercise 2. The sampling Sequence Models	12	737	February 27, 2025
Vote Course 5, Week 1, A2, Sample function exercise Sequence Models	1	559	December 8, 2022
C5W1A2: Sample function in Character level language model - Dinosaurus Island Sequence Models	7	793	September 6, 2022
DLS 5, Week 1 assignment 2, Ex 2 choice parameters a and p are not the same size Sequence Models	2	900	December 6, 2021

Course 5 Week 1 Assignment 2 Exercise 2

Related topics