Hi,
I’m wondering if there’s not an error in the video of https://www.coursera.org/learn/retrieval-augmented-generation-rag/lecture/MYS0a/keyword-search-tf-idf at 6:05 with the IDF scores computed for each words based on the given word occurence in documents (see screenshot below).
For instance, for the word:
making, DF = 3/5 so IDF should belog(5/3) = 0.22,pizza, DF = 2/5 so IDF should belog(5/2) = 0.4,without, DF = 4/5 so IDF should belog(5/4) = 0.1,a, DF = 4/5 so IDF should be the same as withoutlog(5/4) = 0.1oven, DF = 4/5 so IDF should belog(5/4) = 0.1
Could someone explain me what are the scores computed for each word if not IDF or log of IDF?
![]()
