I just went through the topic videos (exponentially weighted averages), and it seems to me that setting V0 to zero and then using bias correction (that requires some computation) to fix the results is weird.
Setting V0 to theta0 (the first reading itself) does a better job and doesn’t require any extra computation, just one simple initial value.
Since I know the people behind this course are very smart, I assume there is a good reason for that.
What am I missing?
