Week 1 Assignment 1 conv_forward (a small pitfall)

corey.s · July 8, 2021, 5:33am

Hello,

I was looking at this one for a while before I finally had the clarity to believe I was wrong. Funny how that works out, haha!

Some people may be familiar with the following…

# for ITER in range(start, stop, step):

I was able to resolve my errors fairly quickly when I realized that “stepping” as I’ve implemented it above and when used at a certain point ( or points) in the starter code would imply striding over the entries of the output.

What the function requires is striding the input indices to map to consecutive output indices.

So, if it helps anyone out there, don’t think…

[Input Side] → [Output Side]
0 → 0
1 → 1 * step
2 → 2 * step
3 → 3 * step
(etc.)

think…

0 → 0
1 * step → 1
2 * step → 2
3 * step → 3
(etc.)

I’ve tried keeping the idea somewhat abstract to not just give the whole thing away.
If this winds up saving anyone 45+ minutes of debugging, then I’ll know I’ve done my part!

TMosh · July 8, 2021, 5:55am

Thanks for your suggestion.

paulinpaloalto · July 9, 2021, 5:09am

Thanks for describing your thought process here. It makes sense. When I use words to describe what happens in conv_forward, the way I say it is: remember that you are looping over the output space. Then within the loops, the work is to calculate where you are in the input space that corresponds to the current position in the output space. The loops must cover every position of the output space. The striding happens in the input space.

Marcel_Noe · October 16, 2021, 6:19am

Thank you very much for pointing that out. Although I had a different error, your message actually got me to check the variables I used to calculate the indices… turns out, I was using i where i should have used h. That one kept me frustrated for several weeks. Thank you!

beckyjeff · October 30, 2021, 12:44pm

I was using h where I should be using w haha! Thanks guys I was convinced I was right but as usual it was a typo!

Lena_Shamseldin · November 13, 2021, 11:01pm

Hi, I have a small question…my code passed all tests however I dont seem to understand where the stride is accounted for since we are iterating the slices’ vertices one by one without skipping 2 when the stride=2 for example…I hope my question is clear enough:)

paulinpaloalto · November 13, 2021, 11:06pm

I think the key point is what I mentioned in this earlier reply on this thread. The loops in conv_forward and pool_forward are over the output space. Your point is exactly correct that we have to visit every point in the output space to provide a value. But the thing to realize is that the striding happens in the input space, right? There we don’t necessarily step one by one. That is the purpose of the stride.

Lena_Shamseldin · November 14, 2021, 7:47pm

Yes I understand, but I think my code does not take the stride into consideration when taking the input slice however it passed all tests which confuses me…in which step exactly should I have included it?

paulinpaloalto · November 14, 2021, 9:54pm

Update: as of February 2022, the tests have been enhanced to catch stride bugs in conv_forward.

The tests here do not catch this bug. That is a bug in the tests, which we have reported to the course staff, but they have not fixed it yet. The tests do not throw an error, but if you carefully examine your output values for the stride = 2 test case, you will see that not all of them agree.

The loops are over samples, h, w and c, right? So it is a 4 level nested loop. In the logic for handling each h and w value, you need to use the stride value to compute where that output location in the image maps to in the input image. Here’s a post from Tmosh that gives the formulas.

ibmzzjn · November 25, 2021, 3:29am

Thanks a lot! Very helpful tips although I’ve already spent more than 45 mins on this…

As an additional note, keep an eye on the range of the loop and make sure it is compatible with the output matrix shape.

Zeeshan_Rizvi · February 17, 2022, 2:52pm

THANK YOU FOR THIS! I had been stuck at this for an hour now, turns out I was making the same mistake. Your reply got me to check

Christopher_Wadswort · March 10, 2022, 6:33pm

This is a crucial point that I’ve been missing for an hour or three. Up until reading this, I was trying to stride on the output instead of on the input - this executed without errors but was not giving me the right Z.mean().

paulinpaloalto · March 10, 2022, 6:42pm

Glad to hear that you found the right post to get you past that issue. The search engine in Discourse actually works really well, unlike in the old Coursera Forums. The new Discourse forums have been in use for almost a year now, so there’s a lot of value in the saved history. You can pretty well assume that any problem you hit has been hit by others before you.

Nice work! Onward!

Bassem_Mansour · November 17, 2022, 7:22am

stride was accounted for in the formula for calculating the shape of the output. we are looping through the output shape, so stride was included.

nfattal · March 21, 2023, 1:15pm

Hi guys,

Referring to the convolution calculation:

Shouldn’t there be an if condition to break out of the for loop when the outer range of the slicing operation is > the padded image dimension?

Regards
Nazih

paulinpaloalto · March 21, 2023, 2:58pm

No, if you need that, it means your code is incorrect. Your shapes and the loop limits should match if you’ve written it correctly. There are a number of posts earlier on this thread which discuss the point. Here’s one that’s a good starting point. And here’s another thread that has a nice description of the way all the loops work in conv_forward.

One other thing to note is that it is a mistake to include the stride as a “step” in the loop limits. That is covered at least implicitly on both the threads that I linked above.

nfattal · March 22, 2023, 4:09am

Noted. Thanks @paulinpaloalto for your feedback.
I think I can make the range stop at the right point without an if statement.

I will go over what you shared.

Thanks
Nazih

dmugunddhan · June 18, 2023, 4:41am

Thanks a lot mate. I’ve spent hours debugging whats wrong and I wish I had seen your post earlier. The mistake I made was, I was adding the stride instead of multiplying. So if anyone worked the wrong logic as me please multiply the stride with h or w

Topic		Replies	Views
Exercise 3 - conv_forward - Got the code right but didn't understand it Convolutional Neural Networks	5	528	February 13, 2022
Exercise 3 - conv_forward - Assertion error Convolutional Neural Networks	6	576	May 29, 2021
Week 1 - Conv_forward (again :D) Convolutional Neural Networks	5	517	November 20, 2022
C4 W1 A1: Error in conv_forward and pool_forward Convolutional Neural Networks	4	387	November 25, 2023
Week 1 Assignment 1, conv_forward() Convolutional Neural Networks	4	696	October 7, 2021

Week 1 Assignment 1 conv_forward (a small pitfall)

Related topics