How to evaluate-visualize clusters derived through PCA

Christian_Simonis · April 16, 2023, 11:45am

since you have conducted a compression (and probably not all variance were explained by that 2 features) you lost some information and cannot exactly reconstruct the original features but of course you can transform your data back to the original space: Feel free to take a look at inverse_transform() at sklearn.decomposition.PCA — scikit-learn 1.3.2 documentation

Also in this thread a PCA reconstruction was discussed, see also this repo w/ mnist dataset.

You could also check the cumulative variance which is explained by:

PC 1
PC 2

see also this repo. I would expect that PC1 has better clustering capabilities than PC2 and the residual information gain per PC would decrease if you would use more features.

Hint: did you already conduct an elbow analysis or did you calculate a silhouette score of your clustering problem, see also this blog post?

Feel free to add a plot and also some more context regarding the problem you are solving.

Best regards
Christian

Topic		Replies	Views
Clustering and PCA Unsupervised Learning, Recommenders, Reinforcement week-3	2	509	July 14, 2023
What info are you getting from these clusters, please? Unsupervised Learning, Recommenders, Reinforcement week-2	3	348	August 31, 2023
Significance of PCA and Explained Variance Ratio (explained_variance_ratio_) Unsupervised Learning, Recommenders, Reinforcement week-2	4	48	October 21, 2024
Need help with kaggle dataset. Please help! AI Discussions ai-discussions	2	103	January 15, 2023
C3-week2-PCA, Principal component analysis vs regularization Unsupervised Learning, Recommenders, Reinforcement week-2	1	21	January 7, 2025

How to evaluate-visualize clusters derived through PCA

Related topics