Language Model and Sequence

Moutasem_Akkad · August 7, 2022, 11:00am

Hi,

I am confused about the notation x = y. x is a Tth example which is a word and y is a probability , am I correct there?

So for x<2>, it is “cat” and y is a probability of a word being something like a float, say 0.7.

Can I get more clarification here as to what we are doing?

balaji.ambresh · August 7, 2022, 11:20am

x<i> refers to the input to an RNN at time, i and y<i> is the output based on a<i-1> and x<i>.
Input to an RNN is a representation say, one-hot encoding of the word and the output corresponds to the softmax output. So, in this case, when the one-hot representation of Cats is input to the network, the RNN outputs probability of each word, given the input. From there, you can select the most likey word using argmax on the output.

Let’s say you found average to have the highest probability. You can now use the representation of average as input at next timestep and so on.

Topic		Replies	Views
Language modelling with an RNN Sequence Models	1	489	February 18, 2023
RNN Model - y Label Meaning Sequence Models week-1	4	94	June 3, 2024
RNN models: Notations Sequence Models week-1	3	16	August 21, 2024
Why y for the probability distribution Sequence Models	3	601	May 17, 2021
C5_W1 Language Model and sequence generation Sequence Models week-1	2	13	December 30, 2024

Language Model and Sequence

Related topics