C1_W3_lecture_nb_02_manipulating_word_embeddings

kamlesh_karki · April 14, 2023, 8:21pm

C1_W3_lecture_nb_02_manipulating_word_embeddings in this “jupyter-notebook”, there is a function “find_closest_word” to find the ‘country’.
In the function there is a commented line

 # Get the norm of each difference vector. 
 # It means the squared euclidean distance from each word to the input vector
 delta = np.sum(diff * diff, axis=1)

But by the defination given in the slides the “equilidian distance” is calculated after taking square root of dot product.
It is creating confusion, what do you guys think.
I have tried with various other methods(by going through defination, please find below.

# Get the norm of each difference vector. 
# It means the squared euclidean distance from each word to the input vector
delta = np.sum(diff * diff, axis=1)
  
  
# It means the squared euclidean distance from each word to the input vector
# here np.sqrt has been added
delta = np.sqrt(np.sum(diff * diff, axis=1))
  
# by defination below is the norm of a vector in one liner 
delta = np.linalg.norm(diff, axis=1)

And all will be giving the same answer

TMosh · April 14, 2023, 11:30pm

If you just want to compare the differences in the word vectors, it doesn’t matter if you take the square root. That won’t change the relative values.

Since square root is computationally expensive, it’s a good idea to leave it out.

kamlesh_karki · April 15, 2023, 6:12am

I am more concerned about the “definition” and what is written as a comment.
Yes I know result will be same with out using square-root. As I already posted code sample.

For such a example of 300 vectors used in the course I don’t think there will be huge saving in computational cost. Even this is the case it should be “mentioned” in comments.

Elemento · April 15, 2023, 5:02pm

Hey @kamlesh_karki,
Welcome, and we are glad that you could be a part of our community As Tom already explained, the results won’t differ in either of the cases; and as far as the comment goes, let me raise an issue with the team.

To keep the code as is, the only thing we would need to change is the following comment from:

Get the norm of each difference vector.

to:

Get the squared L2 norm of each difference vector.

I hope this resolves your issue.

Cheers,
Elemento

Topic		Replies	Views
Find_closes_word function in Lab: C1_W3_lecture_nb_02_manipulating_word_embeddings NLP with Classification and Vector Spaces week-3	1	329	April 5, 2022
DLS C5W2, potential error in lecture 3 Sequence Models	1	503	September 10, 2021
Why is the distance between vectors for the triplet loss calculated as such? Convolutional Neural Networks	5	594	January 12, 2022
C3_W2_Collaborative_RecSys_Assignment Square root Unsupervised Learning, Recommenders, Reinforcement week-2	8	648	October 19, 2022
Issue in Programming Assignment: Deep Learning for Content-Based Filtering Unsupervised Learning, Recommenders, Reinforcement week-2	9	831	January 13, 2023

C1_W3_lecture_nb_02_manipulating_word_embeddings

Related topics