What is exactly a language model? What are we trying to find when we are trying to predict a target word in a sentence? As in, are we predicting the target word, given the E matrix (or the embedding matrix) or are we trying to find methods to build the E matrix?
A language model is an algorithm that predicts sequences of words. This contrasts with other ML methods that predict real values or classifications.