Find_closes_word function in Lab: C1_W3_lecture_nb_02_manipulating_word_embeddings

IH_Banana · April 4, 2022, 9:50pm

Hi guys, I’m not sure how the find_closest_word function is working.
In particular the effect of

delta = np.sum(diff * diff, axis=1)

I thought that we would need to use cosine similarity or euclidean distance to find which is the closest word… but the calculation of delta threw me off. Any advice?

def find_closest_word(v, k = 1):
# Calculate the vector difference from each word to the input vector
diff = embedding.values - v
#print(diff.shape)
# Get the norm of each difference vector.
# It means the squared euclidean distance from each word to the input vector
delta = np.sum(diff * diff, axis=1)
#print(delta.shape, delta[0].shape)
#print(delta[0])
# Find the index of the minimun distance in the array
i = np.argmin(delta)
# Return the row name for this item
return embedding.iloc[i].name

balaji.ambresh · April 5, 2022, 5:30am

Euclidean distance is \sqrt{\sum_i {(x_i - y_i)}^2}
diff represents x_i - y_i
diff * diff is element wise multiplication of the differences i.e. {(x_i - y_i)}^2
np.sum function results in \sum_i {(x_i - y_i)}^2
Instead of using the square root to find the closest vector, you can use the square since you are comparing distances in the same scale using np.argmin.

Topic		Replies	Views
C1_W3_lecture_nb_02_manipulating_word_embeddings NLP with Classification and Vector Spaces week-3	3	290	April 15, 2023
Issue in Programming Assignment: Deep Learning for Content-Based Filtering Unsupervised Learning, Recommenders, Reinforcement week-2	9	831	January 13, 2023
Can I have some help in Practice lab 2 - Exercise 1 Unsupervised Learning, Recommenders, Reinforcement week-2	11	587	August 25, 2022
Deep Learning for Content-Based Filtering problem in assignment Unsupervised Learning, Recommenders, Reinforcement week-2	8	725	January 9, 2023
Nearest neighbor vs cosine similarity NLP with Classification and Vector Spaces week-4	1	537	March 25, 2023

Find_closes_word function in Lab: C1_W3_lecture_nb_02_manipulating_word_embeddings

Related topics