C1W1 - frequency extraction discrepancy between explanation and implementation

Tahir4 · January 2, 2025, 1:31pm

In C1W1 , it is shown that for a tweet the positive and negative frequencies are calculated based on the positive and negative frequencies of the correspnding unique words in the sentence.
Following tweet is given:
‘I am sad, I am not learning NLP’
positive frequency is calculated as 8, but ‘I’ and ‘am’ are actually repeated 2 times, so the frequency should be more than 8.
Uniqueness is only considered in this part. In coding session, we notice that uniqueness is not considered at all. It is also noticeable from the graded programming assignment, where we get different prediction results for the tweets ‘Great!’ and ’ Great, great’

conscell · January 5, 2025, 8:46pm

Hi @Tahir4,

Could you please provide a link to the lecture video you are referring to?

Adazhu · February 18, 2025, 8:03pm

Hi NLP Mentor,

I have a similar question about the word count for positive and negative. I don’t understand why “happy” and “because” are not counted for the positive tweets.

Course link:

Another, in the formular, Xm = [ 1, sum(freqs(w,1), sum(freqs(w,0)], could you explain the explain the parameters for 1, w? ( is it 1 presents one input str?)

A similar question also relevants to another video in C1. How to choose the words and count them?

Course link

image:

Thank you so much,
Ada

paulinpaloalto · February 18, 2025, 11:00pm

It’s been a long time since I watched that lecture, so I may be missing the context here, but I think the point is that they are in the vocabulary but are not in the tweet that is being processed there. That is “I am sad, I am not learning NLP”.

Topic		Replies	Views
Assignment inconsistent with course video: Frequencies for unique words or not? NLP with Classification and Vector Spaces week-module-1	4	493	April 7, 2023
Count words for positive and negative frequencies NLP with Classification and Vector Spaces week-module-1	3	545	May 26, 2023
Confusion in Logistic Regression Overview NLP with Classification and Vector Spaces week-module-1	5	365	October 30, 2023
Why do we take into account only unique words while adding Positive and Negative frequencies in the sentence? NLP with Classification and Vector Spaces week-module-1	1	531	February 7, 2022
Summing the frequencies of unique words, not DUPLICATES NLP with Classification and Vector Spaces week-module-1	4	531	June 15, 2022

C1W1 - frequency extraction discrepancy between explanation and implementation

Related topics