Hello,
When using the cosine similarity the vectors are normalized, cos(a, b) = dot(a, b) / (norm(a) * norm(b))
1/ It means that basically all embeddings are on an arc with radius 1, which means that the only thing that matters is the proximity on the circle arc. So this kind of proximity, unless materialized through a projection on the arc, is not captured:
2/ reg. separation planes, from what I see in the course all planes pass through (0,0), which means that you can’t have a “triangle” region as highlighted in the course.
I am aware that I am probably making some confusion, but it is not clear at all for me where.
3/ If cosine similarity is used for measuring embeddings, then why the embeddings are not normalized from the get-go? How / when to use the fact that embeddings are not normalized?
Thank you for clarifying!