Covariance calculation is wrong?

Captain_Riggs · November 27, 2023, 11:48pm

Hi all.

In week one the ‘def multivariate_gaussian(X, mu, var)’ is used to calculate the probability values. One of the arguments is ‘var’ which in our case is in a 1-d vector format. So this code converts it to a shape of (n,n) so we can use it in the determinant calculation.

Changing to a (n,n) matrix

if var.ndim == 1:
var = np.diag(var)

Determination code which is taken from the longer line of code

‘p.linalg.det(var)**(-0.5)’

However ‘var’ in this (n,n) matrix is missing the cov(x,y) values which is the top right and bottom left of the var matrix.

So they have the default value of 0.

So the code ‘p.linalg.det(var)**(-0.5)’ will not calculate the correct determinant value and hence all the probability values will be wrong?

Should it not be convariance=np.cov(var, rowvar=False) then p.linalg.det(convariance)**(-0.5)’ ?

TMosh · November 28, 2023, 1:55am

Are you referring to the Course 3 Week 1 “Anomaly Detection” notebook, in the multivariate_gaussian() function in the utils.py file?

Captain_Riggs · November 28, 2023, 2:41am

Yep. That is the one.

Sorry should have been more clear about where that piece was code was from.

TMosh · November 28, 2023, 5:37am

I’ll pass it along to the course staff. Statistics is not my strong suit.

Captain_Riggs · November 28, 2023, 5:51pm

Thanks.

Do you know how long it takes for them to reply?

TMosh · November 28, 2023, 6:19pm

Sorry, I do not know.

Captain_Riggs · December 7, 2023, 12:23am

Any word back?

TMosh · December 7, 2023, 12:25am

Not yet.

Captain_Riggs · December 7, 2023, 12:28am

Hmm. Thanks for the reply. How does one find out how long this process takes?

TMosh · December 7, 2023, 12:30am

One cannot find that out. It is unpredictable.

I don’t work for DLAI (mentors are community volunteers), but I would guess that how quickly issues are addressed depends on their severity and the workload of the DLAI staff.

As your report is the only one in several years of this course being active, I’d guess it is not a high priority.

Captain_Riggs · December 7, 2023, 12:36am

Ok. Appreciate your side.

I want to speak to whoever governs you please.

Thanks

paulinpaloalto · December 7, 2023, 12:39am

Not sure what you mean by that. As Tom says, the mentors are just fellow students. We are volunteers, we do not work for DLAI and we don’t get paid to do this. We did not create any of the course materials and we cannot change them.

Do you mean that you want to talk to someone on the Course Staff or who actually works for DeepLearning.AI?

Captain_Riggs · December 7, 2023, 1:14am

Hmm. Confused myself. Let me work through the process as maybe I am barking up the wrong tree.

I am doing the Coursea Machine Learning Specialization course.

I have run into a problem with the course notes and I want to resolve that issue.

Part of the pre-course material pointed me here for any issues with the course material. Is this were I go for help with relation to the ‘Coursea Machine Learning Specialization course.’?

Thanks

paulinpaloalto · December 7, 2023, 1:15am

I am not a mentor for that course but Tom sent me the source code for that function. It’s a given function in the utility file for that assignment.

They explain the meaning of the arguments in the “docstring” of the function. The third argument var gives the covariance matrix, but it can be given in two different forms:

If it is given in vector form, then they create the covariance matrix by using var as the variance in each dimension of the distribution and creating a diagonal matrix with those values as the diagonal.

But the function will also accept var as an n x n matrix, in which case it simply uses it “as is”.

The determinant of the covariance matrix is used in the calculation which is explained in the course materials for Prof Ng’s CS 229 course at Stanford.

Notice that the covariance matrix also appears elsewhere in the computation, when they compute the Moore-Penrose pseudoinverse of it.

paulinpaloalto · December 7, 2023, 1:19am

Yes, it is. But we are only the first line of defense. There is no guarantee that we can answer every possible question that can be formulated. We will try to call for backups in cases in which we can’t answer, but there is no guarantee that will happen in as timely a fashion as you might wish.

Captain_Riggs · December 7, 2023, 1:41am

Thanks for the reply. I think communication is important in any course.

The people who run this course expect me to pay my money for the right to do this course. Note that this is a two way process as I then expect to be taught appropriately.

Any issues that arise I would expect, as I am paying, that such issues get resolved in a timely manner.

paulinpaloalto · December 7, 2023, 1:44am

Well as Tom and I have said, that is above our pay grade, which as we mentioned is zero.

But I think I answered your question above if you have had time to read it.

Captain_Riggs · December 7, 2023, 2:27am

Thanks. That was a big help in a weird way.

Those notes are way, way over my head BUT I managed to deduce that there are two covariance arguments. Diagonal and non-diagonal covariance. What you use depends on your data spread. The course notes used diagonal covariance whereas I used non-diagonal covariance. Something about whether is are relationship between the features or not. No relationships then you use diagonal covariance otherwise non-diagonal.

I will read more on this later tonight.

Thanks

Captain_Riggs · December 7, 2023, 2:30am

I understand.

Funny really, but to me it is you people, who are unpaid, that keep this course going.

So thanks for the hard work.

paulinpaloalto · December 7, 2023, 2:49am

You can see in the notebook that they use the estimate_gaussian function to compute the arguments that they later pass to multivariate_gaussian. And what they do is compute the “elementwise” variance for each feature across all the samples in the input dataset. So apparently that “diagonal” version of covariance, which doesn’t account for interactions between features, is good enough for the purposes here. So the whole distribution is really being computed independently “per feature”. At least that’s my interpretation of how they are doing things.

The simplistic point just looking at the implementation of that specific multivariate_gaussian function is that they are not computing the covariance but just taking it as an input.

Topic		Replies	Views
More detail about multivariat_gaussian numpy implementation Unsupervised Learning, Recommenders, Reinforcement week-1	17	387	November 14, 2023
Some additional explanation for multi feature Anomaly Detection would be great Unsupervised Learning, Recommenders, Reinforcement week-1	3	515	August 14, 2022
Discrepancy in the lecture video: Covariance of a Probability Distribution Probability & Statistics for Machine Learning &... week-2	2	412	July 7, 2023
Categorical variables in anomaly detection Unsupervised Learning, Recommenders, Reinforcement week-1	4	633	September 22, 2022
C3_W1_Anomaly_Detection_ the array p_val Unsupervised Learning, Recommenders, Reinforcement week-1	2	505	August 27, 2022

Covariance calculation is wrong?

Changing to a (n,n) matrix

Determination code which is taken from the longer line of code

Related topics