How exactly does k means know that a cluster centroid is closer to these set of data sets?

tbhaxor · February 19, 2023, 10:51pm

In the kmeans we dont know wether this point will lie close to k_{\text{th}} cluster or not. That is it checks distance for all the points.

In the example shared in the following pic, how it determined which cluster would be closer to the data point and assign its index

is that all points distance is calculated considering all the cluster then argmin is used to determin which cluster is nearest to the data point?

rmwkwok · February 19, 2023, 10:58pm

Hello @tbhaxor, if you have 5 centroids, 10 datapoints, then 5*10 distances are needed to determine for each datapoint, which centroid is the nearest. Yes, argmin would be used. You will practice this process in C3 W1 Assignment 1 for K-means.

Cheers,
Raymond

tbhaxor · February 19, 2023, 11:00pm

Thank you @rmwkwok it solved my doubt

tbhaxor · February 19, 2023, 11:01pm

Also I see we can use the clustering as preprocessing of supervised learning. Idk if this is done or found efficient

rmwkwok · February 19, 2023, 11:04pm

You are welcome, @tbhaxor!

There are discussions and papers on the internet for clustering as a preprocessing tool! We can see if they make sense.

Cheers,
Raymond

Topic		Replies	Views
Module 1 -> Clustering -> k-means optimization objective Supervised ML: Regression and Classification week-module-1	7	37	July 2, 2025
Understanding K-mean clusters Unsupervised Learning, Recommenders, Reinforcement week-module-1	4	542	January 7, 2023
K means optimization Unsupervised Learning, Recommenders, Reinforcement week-module-1 , coursera-platform	19	66	July 11, 2025
Practical questions about K-means cost function Unsupervised Learning, Recommenders, Reinforcement week-module-1	4	329	March 25, 2024
Is k means clustering some how related with k nearest neighbour? Unsupervised Learning, Recommenders, Reinforcement week-module-1	6	559	February 19, 2023

How exactly does k means know that a cluster centroid is closer to these set of data sets?

Related topics