Help needed for get_top_covariances

sooolee · October 3, 2022, 6:27am

Hi,
I get this error for this function:

In terms of the shapes of cov_matrix, top_indices, relevant_indices, highest_covarainces, I think I’m getting it all correct… for the first test example, they are (4,4), (4,), (3,), (3,).
In my helper function get_top_mag_indices, I already addressed the descending order by sorting (-np.abs(values)).

Conceptually, my codes are:
cov_matrix = covariance_matrix_from_examples(the changes)
top_indices = get_top_magnitude_indices(cov_matrix for the target index row)
relevant_indices = (remove_from_list(top_indices, target_index)) - top 10
highest_covariances = relevant_indices for target index from cov_matrix

Where am I wrong? Thanks!

gautamaltman · October 4, 2022, 4:59am

Hi Sooolee,
Are you actually hard coding top 10 indices, or are you using the input top_n=3 for the failed example?

sooolee · October 4, 2022, 5:31am

Hi Gautamaltman,

I’m using [:tom_n]

Here’s my code. I will delete as soon as I hear from you. Thank you!

gautamaltman · October 4, 2022, 5:51pm

I deleted your code but will check against the solution I have.

gautamaltman · October 4, 2022, 6:26pm

Hello again Sooolee,
I copy and pasted your code to my notebook and it passes the unit tests. Would you kindly try restarting the notebook kernel and rerunning? Perhaps you have stale data?

sooolee · October 4, 2022, 9:00pm

Hi gautamaltman,

I tried restarting the kernel and reran, and get the same error:

But the other test block shows the orders correctly:

All previous cells have passed the tests. So I don’t know what is wrong. Can I send you a private message and share my codes including the helper functions?

gautamaltman · October 4, 2022, 10:10pm

Yes, i got your message, it is likely that one of the earlier functions is not quite correct even though it passed the earlier unit tests. I will check against the code you sent and get back to you.

gautamaltman · October 5, 2022, 1:17am

Hi Sooolee,
The issue is in the implementation of your remove_from_list() function. When I execute my code for this function for the following test case;
indices == [1 2 3 0 ], index_to_remove == 1 my code returns
[2 3 0] and passes the unit tests

when I execute your code on the identical data your code returns
[1 2 0] and fails

We can see that we want to remove the 1 from the input indices array, not 3. So maybe take another look at your implementation of this! You just got lucky ( or unlucky) passing the 1 unit test for remove_from_list()

sooolee · October 6, 2022, 9:56am

Hi gautamaltman,

Thanks a lot for looking into this! You are right, I was lucky but also unlucky to pass that code! hahaha. I see the point clearly. I am supposed to delete the element in indices based on the ‘value’ of index_to_remove, whose index is 0. What I was doing was removing index[2], where 2 was found by indices[index_to_remove].

So I tried to fix it. I tried to find the right index (in your example, it would be 0) using np.where(indices == index_to_remove). The code was going through without an error, but it wasn’t finding any value, returning an empty list.

I tried “indices.index(index_to_remove)”, but this one throws an error ‘numpy.ndarray’ object has no attribute ‘index’ in the function code cell, although it was outputting 0 correctly in a separate cell. Please see below.

I was going to use np.delete as before once I find the right index to delete. Can you further help me with this? Any insight here?

Thank you!

gautamaltman · October 7, 2022, 5:48am

Hi Sooolee!

I think the issue is that the Numpy where() method requires a Numpy array, not a Python list. Do the following:
indices = …
indices = np.array(indices)
print(np.where(indices == index_to_remove))

I think you will also need to index the returned Numpy array (even though it’s just a single value in the array)

Finally, if you insist on using a Python list instead of a Numpy array, you would want to use the .index() function like so:
indices.index(index_to_remove)

sooolee · October 7, 2022, 9:17am

Hi gautamaltman!

Thanks a lot!! Based on your comments I could make it work and I passed! However, I have some lingering clarification questions.

I thought indices were a numpy array because:

But I made sure it becomes an array. So this worked:
indices = np.array(indices)
ind = np.array(np.where(indices == index_to_remove))
Then I used np.delete.

My first question is, when I look up np.delete, the second part (deleting object) can be integer…

So having to turn the returned index to an array was a surprise. Are there any best practices or tips around this? Or it’s just way it is when you deal with arrays?

My second question: I tried Python list index() method - it worked too but I had to turn the indices to a list, which makes sense. Then I used np.delete, which worked. But it is confusing to me - how just using ‘list’ and ‘integer’ works when np.delete expects np arrays as inputs?

Again, thanks a lot for helping me understand and learn through this assignment. It’s been a tremendous help!!

Wendy · October 7, 2022, 11:12pm

@sooolee,
In case a similar situation comes up in the future: one Python tip that might not be obvious at first, but I think could have helped you with a simpler implementation for remove_from_list() where you wouldn’t have had to use np.array or np.delete:

With python, you can index directly with a where clause (i.e. it can go directly in the square brackets). As an example:

gautamaltman · October 20, 2022, 9:59pm

Hi Sooolee!

For your first question, I think you can use a list of indices for the second input to numpy.delete:
( it is labeled obj in the documentation)

Example:

ind = np.array([1,2,3,4])
idx = [1,2]
np.delete(ind,idx)
array([1, 4])
Not sure why you think it must be a Numpy array?

For your second question, I think Numpy does it’s best to convert the input list into a numpy array. It does make certain assumptions about the data type when it does this. Since a list can contain any data types mixed and matched and a numpy array must only contain one data type for the whole array. This is in order for Numpy arrays to benefit from optimization techniques. This example shows this property:

tst = [1,2,3,4,‘0’]
np.delete(tst, 3)
array([‘1’, ‘2’, ‘3’, ‘0’], dtype=‘<U21’)
We observe how Numpy has converted all the elements to string types and returned them in a Numpy array. However, if we are consistent with our data type it returns the values as expected:
tst = [1,2,3,4]
np.delete(tst, 3)
array([1, 2, 3])

Of course you are welcome to also provide Numpy an array like so:

np.delete(np.array(tst),3)
array([1, 2, 3])

It may be better practice to provide a Numpy array so that you are deliberate about what data types are in the array and don’t run into the issue where Numpy decides for you what the underlying data type is.

Joshua_Kessler · October 21, 2022, 7:21pm

having trouble with this too. How do I add a new thread on here?

Wendy · October 22, 2022, 12:34am

Welcome to the community, @Joshua_Kessler!

The way to start a new thread (aka a new topic), is to go to the appropriate category and click on the “+ New Topic” button.

If you’re having the same general problem described here, one thing I’d suggest is to start by looking at your remove_from_list() implementation. There’s a lot of discussion in this thread about using delete and converting between np arrays and python lists, but I don’t think it needs to be that complicated. Take a look at the Python tip I mentioned above and see if it helps you get unstuck. If not, go ahead and add a new topic to explain where you’re getting stuck and we’ll help you figure out next steps

sooolee · October 22, 2022, 3:49am

Good to know! Thanks, Wendy. I think it cannot be applied in this case because we are trying to delete a particular index, which is not orderly (such as < 2) , right?

sooolee · October 22, 2022, 3:55am

Thank you!! Very helpful!

Wendy · October 22, 2022, 4:48pm

@sooolee, < 2 was just an example, but the concept of using a where clause to choose what you want to keep should apply here. It doesn’t need to be a <. It could be >, ==, !=, … - whatever you need in your where clause to choose the items in the list you want to keep.

Go back and take a look at remove_from_list() and think about what you would need to put in the where clause to pick out the items you want to keep for new_indices. You should be able to implement your part of the function fairly simply in just one line using a where clause in the square brackets.

sooolee · October 23, 2022, 7:16am

Hi Wendy, thanks for insisting me to try this out. I was not reading / thinking clearly as I am trying to pick up from where I left after a long vacation!

You are right. I tried with np.where only as you suggested without using np.delete and it worked in just one line of code. I still had to use np.array(indices) to make sure Numpy gets an array (as advised by @gautamaltman) but it is very nice to keep it simple in one line! Thanks a lot, I really appreciate the advice!

Topic		Replies	Views
Help with get_top_covariances Course 2 Week 2? Build Better Generative Adversarial Networks week-2	6	679	July 10, 2022
Question of get_top_covariances function in C2W2 assignment Build Better Generative Adversarial Networks week-2	2	655	July 22, 2022
Last part of Programming assignment W2 Build Better Generative Adversarial Networks week-2	4	435	August 30, 2023
C2W2_Assignment UNQ_C1 get_top_variances -- meaning of assert error Build Better Generative Adversarial Networks week-2	8	684	July 20, 2023
C2W2 Assignment get_top_covariances unit test Build Better Generative Adversarial Networks week-2	5	562	July 9, 2023

Help needed for get_top_covariances

Related topics