C1_W3_Assignment compute_pca()

kamol · January 27, 2024, 11:26am

Link to the classroom item you are referring to: Coursera | Online Courses & Credentials From Top Educators. Join for Free | Coursera

Description:
I have a few questions regarding the graded assignment. Why do we need to sort eigen_values since it’s not being used? The indices from np.argsort(eigen_vals) is directly applied to eigen_vecs matrix, which then is used to transform input matrix X. Why do we need to sort eigen_values as well?

paulinpaloalto · January 27, 2024, 5:59pm

The point of the PCA algorithm is that you want to reduce the dimensions by removing the dimensions that are the least meaningful. Think about what the eigenvalues and eigenvectors mean: the eigenvectors form a basis of the transformation and each one is a vector in the direction of which the transformation is \lambda_i * e_i. So that gives you the information you need: the larger the magnitude of the eigenvalue, the more meaningful is that dimension in the transformation. So you want to remove the dimensions starting with the smallest eigenvalues. I’ve never watched the lectures in NLP C3, but what I’m saying here is what I learned from Prof Ng when he discussed PCA in the original Stanford Machine Learning course. I would hope they would mention that in the lectures here.

Update: sorry, I probably missed the point of your question on the first pass. Yes, the argsort will accomplish what you really need, so it’s not clear why they would also care about having the eigenvalues themselves sorted as a separate thing.

bwegge · July 19, 2024, 4:55pm

Moreover, the according to the docs , eigh returns the “eigenvalues in ascending order, each repeated according to its multiplicity.” So there’s no need for sorting at all, we can just reverse their (and the corresponding eigenvectors’) order
Also, the last instruction to multiply the transposed EVs and zero-mean data and then transpose again seems a bit clumsy… (A^T * B^T)^T = (B^T^T * A^T^T) = B*A

paulinpaloalto · July 19, 2024, 6:13pm

That’s a good point: just reversing the order of the result would get us what we need in a more efficient way.

It’s also a well known theorem that:

(A \cdot B)^T = B^T \cdot A^T

so you’re right that they could have expressed that more simply. I’ll take a look at the git issues on this course and file another one with your suggestions.

Thanks!

paulinpaloalto · July 23, 2024, 8:40pm

Well, it’s slightly more subtle than that: what they really need is the indices of the elements in descending order. So what they are getting with the np.argsort is the conversion to the list of indices, which they can then reverse and then use to index the eigenvector matrix to select the corresponding vectors. Well, they could have just used range(len(eigen_vals)) as an array and then reversed the order of that. But maybe that would have required more explanation and it’s not worth the saved sort. Note that the sort operation will be very cheap, because (as you pointed out) the values are already in the desired order.

But I will file an enhancement request making the point about the needless transposes and the point that the OP of this thread makes about the sorted eigenvalues array being unnecessary.

Topic		Replies	Views
Four out of Six on compute_pca NLP with Classification and Vector Spaces week-3	2	389	April 4, 2022
Help with multiplying eigen vectors to obtain X_reduced NLP with Classification and Vector Spaces week-3	12	375	July 30, 2024
C1_W3 assignment compute_pca got complexed-value eigenvectors NLP with Classification and Vector Spaces week-3	16	468	April 21, 2024
Question on C1 W3 PCA Algorithm NLP with Classification and Vector Spaces week-3	3	377	September 5, 2022
C1_W3_Assignment_help with PCA Calculations NLP with Classification and Vector Spaces week-3	7	44	October 28, 2024

C1_W3_Assignment compute_pca()

Related topics