we have learned in lectures that in PCA algorithm, the first step is to reduce X by the mean and then divide it by standard variation as seen in lectures:
But did it answer your question “why”? (since the thread kind of stopped). Did you try, for example, to add ‘Vilnius’ to the words list? What happens in all of the cases (X not changed, X demeaned, or X standardized)?