W3E7 deep copy, and gradient equation

paulinpaloalto · October 23, 2023, 6:44pm

Yes, the comment is misleading: you should copy all the parameters, but it really only matters if you use the “in place” operation for doing the updates as in:

W1 -= .... formula for updating W1 ....

If you use the code:

W1 = W1 - ... update term ...

then copying does not matter. This is all explained on this thread.

But the way the comment is written is also a bit misleading. The point of “deepcopy” as opposed to just “plain copy” as in:

W1 = W1.copy()

is that you can do the deepcopy on the whole parameters dictionary in one shot like this:

parameters = copy.deepcopy(parameters)

and it solves the whole problem by duplicating all the memory objects, as explained on that thread I linked above.

Note that the autograder has actually been fixed here not to require copying even with the “in-place” version of the code, but in Week 4 we do need to do this.

Topic		Replies	Views
Update Parameters in Building NN Step-by-Step Lab Neural Networks and Deep Learning	2	587	April 17, 2023
C1 - W3 Exercise 7 copy.deepcopy Neural Networks and Deep Learning	4	962	August 14, 2022
Week 3 Programming Assignment 1 copy.deepcopy Neural Networks and Deep Learning week-3	2	218	April 7, 2024
Week 3 assignment: Planar Data Classification with One Hidden Layer Neural Networks and Deep Learning	5	768	March 13, 2023
Week1 Exercise3 Gradient Checking Improving Deep Neural Networks: Hyperparameter tun	3	580	January 26, 2023

W3E7 deep copy, and gradient equation

Related topics