In the lecture video of Picking the Most Likely Sentence, at 6:33 minute, this statement not understandable , can please help to explain ?
So, it’s quite possible that if you just pick the third word based on whatever maximizes the probability of just the first three words, you end up choosing option number two. But, this ultimately ends up resulting in a less optimal sentence, in a less good sentence as measured by this model for p of y given
Also at 7:49, can the below statement explained a bit more, looks not understandable ?
So, this is just a huge space of possible sentences, and it’s impossible to enumerate them all, here what does it mean enumerate them all ?