Query regarding TF-IDF calculation

Rhythm_Dutta · June 13, 2026, 3:39pm

A doubt on how exactly do we calculate the tf-idf score for a document. As per the the course slide we calculate the tf for each doc and normalize them. We calculate the idf of each word as mentioned. All good. But my question is whether we multiply the idf of each word to the tf of each (word, doc) pair. But that doesn’t factor in the normalization, right? Do we normalize after multiplying the idf scores? Please help.

Deepti_Prasad · June 13, 2026, 4:21pm

@Rhythm_Dutta

TF is normalized by dividing the word count by the total number of words in that document, and then idf checks the rarity in the complete corpus.

then tf-idf score is calculated by tf x idf (please note there is no normalisation at this step.)

balaji.ambresh · June 13, 2026, 6:21pm

Topic		Replies	Views
Error in C1M2 of RAG for computed TF-IDF scores? Retrieval Augmented Generation week-module-2 , dl-ai-learning-platform	1	31	February 7, 2026
I don't understand TF scoring Retrieval Augmented Generation week-module-2 , dl-ai-learning-platform	3	55	November 2, 2025
Module 2: Keyword Search Retrieval Augmented Generation week-module-2 , coursera-platform	1	42	September 17, 2025
Sentiment frequency model vs. TF/IDF NLP with Classification and Vector Spaces week-module-1	1	520	August 22, 2022
C3_W2_Lab2_Ex1_indices for users? Unsupervised Learning, Recommenders, Reinforcement week-module-2	5	501	November 5, 2022

Query regarding TF-IDF calculation

Related topics