Week 2 - cosine similarity

Pavel_Grobov · May 17, 2023, 7:03pm

Hi ,
In lecture Andrew says that cosine similarity of 2 vectors u,v is u.T * v / |u|*|v|

but in the linear algebra course I learned that cosine similarity is u*v / |u|*|v|

which of these formulas is the right?

paulinpaloalto · May 17, 2023, 8:22pm

They are the same, although I don’t think you rendered the first one correctly. Prof Ng uses * to mean “elementwise” product, but that is a dot product in that formula. It’s just a notation issue. In the C5 W2 A1 assignment, they show the formula in mathematical form as:

CosineSimilarity(u, v) = \displaystyle \frac {u \cdot v}{||u||_2||v||_2}

That is the same as the formulation you show from the Linear Algebra course. But if you then express that in numpy code and the vectors u and v are column vectors, then the numerator would be u^T \cdot v or np.dot(u.T, v).

TMosh · May 17, 2023, 8:51pm

Whether you need to transpose depends on the shape of the vectors. The math is a dot product. The transpose is really an implementation detail.

rmwkwok · May 17, 2023, 10:21pm

Hello @Pavel_Grobov

I would put it this way:

To begin with, .T means the Transpose operation.

Then, when I learned linear algebra, I learned about vectors and matrices. For vectors, we have the dot product. For matrices, we have the matrix multiplication and the transpose. The transpose is for matrices.

When we dotted two vectors, we wrote {\bf a} \cdot {\bf b} and cared not their orientations because there is only one way to dot two vectors up.

When we learned matrix, we brought our understanding of a vector to a higher level where we knew there were two possible variants: a row vector (a 1-row matrix) and a column vector (a 1-column matrix).

Now, the orientations matter when we multiply two matrices up, and we have the Transpose operation (.T) introduced for matrices.

If we have two row vectors, in our primary understanding of vector algebra, we can write it as {\bf a} \cdot {\bf b}. However, if we have two 1-row matrices, we write it as {\bf a}{\bf b}^T. In other words, {\bf a} \cdot {\bf b} = {\bf a}{\bf b}^T, or that the dot product of 2 vectors are equivalent to the matrix multiplication of a 1-row matrix and the Transpose of another 1-row matrix.

Similarly, if we have two column vectors, we have {\bf a} \cdot {\bf b} = {\bf a}^T {\bf b}. (Note that the \cdot symbol is used exclusively for vector-vector dot product but not for matrix-matrix multiplication)

Therefore, I think whether or not you have the Transpose operation there depends on the context. On text books or on slides, we can freely switch between contexts to support our use of symbols. However, when coding, we almost always represents a vector as either a 1-row or a 1-column matrix, and such representation fix our context to the matrix context where we cannot miss out the Transpose operator.

Cheers,
Raymond

Pavel_Grobov · May 18, 2023, 5:40am

Now I get it!
Thank you

Pavel_Grobov · May 18, 2023, 5:41am

Thanks for the answer!

Pavel_Grobov · May 18, 2023, 5:54am

Hi @rmwkwok ,
Thanks for the detailed answer as always

Topic		Replies	Views
Incorrect cosine similarity equation in video and notes Sequence Models	3	536	March 1, 2023
Why is Cosine Similarity calculated V2.V1_T and not V1.V2_T in C3W3_Modified_Triplet_Loss? NLP with Sequence Models week-3	11	54	November 5, 2024
C3_W2 content-based filtering, dot product comment on slides Unsupervised Learning, Recommenders, Reinforcement week-2	2	244	March 1, 2024
Confusion on the dot product Unsupervised Learning, Recommenders, Reinforcement week-2	7	29	April 12, 2025
Cos(theta) formula. Please help to understand Linear Algebra for Machine Learning and Data Sc... week-3	5	422	November 28, 2023

Week 2 - cosine similarity

Related topics