Confusing or incorrect explanation of CLT - Central Limit Theorem - Continuous Random Variable

pang.luo · June 18, 2023, 4:03am

Hi guys,

I reckon that the explanation of CLT in the video “Central Limit Theorem - Continuous Random Variable” is incorrect or confusing.

First of all, the explanation and code about CLT in the lab after the video is correct. The rough idea is,

Toss k dice each trial and record the mean value of these k dice.
The mean approximates a normal distribution as k → infinity, but in practice k > 30 will suffice.
We can draw a plot with k=30 and a large number of trials to verify the theorem. We can increase k to see the effect.

However, the idea conveyed by the video seems to be different. It seems to state,

Toss 3 dice each trial and record the mean value of these 3 dice.
Draw plots with different numbers of trials: 5, 25, 50 and 100.
The plot for 100 trials will look most like a normal distribution while the plot for 5 won’t.

If this is indeed what the video meant to convey, then it is an incorrect explanation of the theorem.

The code below reflects my understanding of the video. The video seems to say that the distribution is not quite a normal distribution when the argument count is 5, but will approximate a normal distribution when count is 100. This is NOT CLT.
I wonder if the video meant something else.

code.py (934 Bytes)

def plot_video(count):
    array1 = np.array([np.random.choice(dice) for _ in range(count)])
    array2 = np.array([np.random.choice(dice) for _ in range(count)])
    array3 = np.array([np.random.choice(dice) for _ in range(count)])
    array = (array1 + array2 + array3) / 3
    sns.histplot(array, stat='frequency')

lucas.coutinho · June 30, 2023, 7:20pm

Hi @pang.luo

Sorry for the late response. I have read your post and forwarded this to our team. In fact this is a mistake from our end. You are correct to assume that the code in python describes what you saw in the lecture and this does not illustrate the CLT. In fact the resulting distribution is not even normal. It is symmetric around the mean, this is why it looks normal for large values. However the distribution is bimodal, therefore it cannot be Normal.

Thanks for pointing this out! We are recording new version of the video and we will update it as soon as possible.

Thanks,
Lucas

Topic		Replies	Views
Connecting dots - Central Limit Theorem - Discrete Random Variable Probability & Statistics for Machine Learning &... week-3	1	455	June 30, 2023
Central Limit Theorem with n=1 Probability & Statistics for Machine Learning &... week-3	1	409	August 18, 2023
Lab 01: Central Limit Theorem Probability & Statistics for Machine Learning &... week-3	4	440	July 12, 2023
Confused between sample size and number of samples in CLT and Law of Large numbers Probability & Statistics for Machine Learning &... week-3	3	662	July 13, 2023
Central Limit Theorem - Continuous Random Variable Probability & Statistics for Machine Learning &... week-3	9	302	March 16, 2024

Confusing or incorrect explanation of CLT - Central Limit Theorem - Continuous Random Variable

Related topics