While trying to extract features from Positive and Negative frequencies of words in tweets we do it only for unique words in the sentence. For instance if we have a sentence like that → “This is really fine. It’s fine becuase …” we add the word “fine” 's positive and negative score only once. I want to ask why do we have such a kind of approach? Why don’t we do it for each word? What kind of bias we can add while taking into account all words not just unique ones?
From my point of view having two “fine” in the word has to increase the probability of being positive sentiment. However while taking into account only unique words having two “fine” doesn’t increase the probability.