Positional_encoding (initialization)

Abhishekhp · March 11, 2022, 2:28pm

How to initialize ? zero didnt work and
no value for k given.

{moderator edit - solution code removed}

paulinpaloalto · March 11, 2022, 4:28pm

I freely admit this is all a little confusing. But it would be a good idea to study the “docstring” for the get_angles function more carefully. They tell you everything you need to know there to be able to call it correctly based on the information you are given in positional_encoding. The value of the k argument is determined by the value of d, right? But it’s not just a scalar. You may also find np.arange useful. In fact if you look at the test cell for get_angles, they gave you an example of how to use it for this purpose.

Abhishekhp · March 11, 2022, 4:51pm

Thanks for reply. Actually, still struggling to debug this
get_angles have 3 parameters
But in positional_encoding…I tried
but lead to another error

{moderator edit - solution code removed}

def get_angles(pos, k, d):
“”"
Get the angles for the positional encoding

Arguments:

    pos -- Column vector containing the positions [[0], [1], ...,[N-1]]
    k --   Row vector containing the dimension span [[0, 1, 2, ..., d-1]]
    d(integer) -- Encoding size

Abhishekhp · March 11, 2022, 4:58pm

Also tried

{moderator edit - solution code removed}

paulinpaloalto · March 11, 2022, 5:04pm

That’s one step closer, but you’re still not really understanding the arguments to get_angles. At least you’ve now made the first argument a list, but that means it is a 1D object. It needs to be a 2D column vector, right? The docstring literally says that. So you need a reshape there. Then you’re still passing a scalar for the k argument, but that is supposed to be a 2D row vector. It literally says that also in the docstring for get_angles.

Abhishekhp · March 11, 2022, 5:11pm

instead of reshaping within,
I just used [
]

but didnt help

{moderator edit - solution code removed}

paulinpaloalto · March 11, 2022, 5:13pm

Well, does that actually work? Apparently not. Try printing the shape of [np.arange(42)]. Is it a 2D column vector?

This is how debugging works. You can’t just wonder what something does: you have to actually verify the behavior at a more granular level rather than treating the whole thing as a black box.

paulinpaloalto · March 11, 2022, 5:19pm

out = np.arange(10)
print(out)
print(type(out))
print(f"out.shape {out.shape}")
out = [np.arange(10)]
print(out)
print(type(out))
print(f"out.shape {out.shape}")

Running the above gives this:

[0 1 2 3 4 5 6 7 8 9]
<class 'numpy.ndarray'>
out.shape (10,)
[array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])]
<class 'list'>
---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
<ipython-input-5-71d5a5451bab> in <module>
      6 print(out)
      7 print(type(out))
----> 8 print(f"out.shape {out.shape}")

AttributeError: 'list' object has no attribute 'shape'

Hmmmmm. Perhaps a bit more thought is required.

Abhishekhp · March 11, 2022, 5:26pm

thx for your advice and effort to explain in detail…But

I checked the working

{moderator edit - solution code removed}

Since position is just scalar

{moderator edit - solution code removed}

paulinpaloalto · March 11, 2022, 5:30pm

One step at a time, I guess. Well, what kind of vector is the k argument supposed to be? We’ve been over this, right?

Abhishekhp · March 15, 2022, 3:24pm

Solution found: encoding size is d and not positions !

paulinpaloalto · March 15, 2022, 4:51pm

Glad to hear you got to the solution. Notice that I mentioned the relationship between k and d in my very first reply on this thread. It’s also clearly specified in the docstring for the get_angles function. The information is there for you to see, but it requires that you read it carefully. Being in a hurry when you read the instructions generally ends up wasting more time than it saves. Just sayin’ …

Redouane16 · March 22, 2022, 11:21am

Topic		Replies	Views
C5_W4_A1_Transformer_Subclass_v1 : UNQ_C2 Sequence Models coursera-platform	11	571	August 11, 2023
Week 4 Assignment - Implementation of the positional_encoding function Sequence Models coursera-platform	4	1474	April 22, 2022
Can't move past this. Please help (positional_encoding) Sequence Models coursera-platform	5	725	March 11, 2022
Positional_encoding() wrong shape Sequence Models coursera-platform	6	602	October 3, 2021
W4 A1: initialization problem in positional_encoding() Sequence Models coursera-platform	2	659	June 20, 2022

Positional_encoding (initialization)

Related topics