Variance calculation for sample

TMosh · February 6, 2024, 4:32am

The concepts are whether your set of data is the entire population, or if your set of data is a sample from a larger population.

The formulas to compute the variance are slightly different for the two cases.

The key reason the values are different is that if you have a small sample, but you want a very accurate result, you have to use “sampling with replacement”. That is, you pick a member of the sample at random, record its value, then put it back, and repeat this many times. If you didn’t replace the values after you pick them, you will run out of data in the sample.

The data proof runs quite some length, but you can find it in the Wikipedia article on “Variance” if you want the details.

The end result is that you divide by (N) if you are testing the whole population, but you divide by (N-1) if you only have a sample of the population.

Topic		Replies	Views
Biased Estimator - Sample Variance Probability & Statistics for Machine Learning &... week-module-3	2	121	June 8, 2024
Variance calculation Linear Algebra for Machine Learning and Data Sc... week-module-4	1	24	February 2, 2025
Does "MLE: Gaussian example" video have a conceptual mistake? Probability & Statistics for Machine Learning &... week-module-3	5	499	July 18, 2023
C3_W3_Week 3 - Summative Quiz Probability & Statistics for Machine Learning &... week-module-3	5	300	February 7, 2024
Probability & Statistics w2 - Variance in Skew lecture notes calculated incorrectly? Probability & Statistics for Machine Learning &... how-to-forum	2	87	June 18, 2024

Variance calculation for sample

Related topics