Confusing or incorrect explanation of CLT - Central Limit Theorem - Continuous Random Variable

Hi guys,

I reckon that the explanation of CLT in the video “Central Limit Theorem - Continuous Random Variable” is incorrect or confusing.

First of all, the explanation and code about CLT in the lab after the video is correct. The rough idea is,

  1. Toss k dice each trial and record the mean value of these k dice.
  2. The mean approximates a normal distribution as k → infinity, but in practice k > 30 will suffice.
  3. We can draw a plot with k=30 and a large number of trials to verify the theorem. We can increase k to see the effect.

However, the idea conveyed by the video seems to be different. It seems to state,

  1. Toss 3 dice each trial and record the mean value of these 3 dice.
  2. Draw plots with different numbers of trials: 5, 25, 50 and 100.
  3. The plot for 100 trials will look most like a normal distribution while the plot for 5 won’t.

If this is indeed what the video meant to convey, then it is an incorrect explanation of the theorem.

The code below reflects my understanding of the video. The video seems to say that the distribution is not quite a normal distribution when the argument count is 5, but will approximate a normal distribution when count is 100. This is NOT CLT.
I wonder if the video meant something else.

code.py (934 Bytes)

def plot_video(count):
    array1 = np.array([np.random.choice(dice) for _ in range(count)])
    array2 = np.array([np.random.choice(dice) for _ in range(count)])
    array3 = np.array([np.random.choice(dice) for _ in range(count)])
    array = (array1 + array2 + array3) / 3
    sns.histplot(array, stat='frequency')

Hi @pang.luo

Sorry for the late response. I have read your post and forwarded this to our team. In fact this is a mistake from our end. You are correct to assume that the code in python describes what you saw in the lecture and this does not illustrate the CLT. In fact the resulting distribution is not even normal. It is symmetric around the mean, this is why it looks normal for large values. However the distribution is bimodal, therefore it cannot be Normal.

Thanks for pointing this out! We are recording new version of the video and we will update it as soon as possible.

Thanks,
Lucas

1 Like