Things to remember -> dot product?

DTodd · August 31, 2022, 5:55am

Hi,
In the things to remember at the end of programming assignment 1 in week 1 of DLS course 4 on CNN it suggests that “A convolution extracts features from an input image by taking the dot product between the input data and a 3D array of weights (the filter).” However the math used in this section and the slides and videos never mentioned using a dot product, and in fact we only used simple multiplication.
I don’t see where the concept of a dot product was ever introduced in this section.
David

TMosh · August 31, 2022, 6:35am

I agree with you.

It seems to be a convention in the literature of CNN’s to refer to this element-wise product and sum process as a “dot product” when discussing convolution, even though it technically isn’t the same as the dot product you’re familiar with from 2D linear algebra.

paulinpaloalto · August 31, 2022, 6:07pm

This is a good point. Their wording is maybe a bit awkward. But note that the operation is more than just an elementwise multiply of the filter with the input: you then add up all those products to produce a scalar result for each position in the output. So it does sort of have the flavor of a dot product (elementwise multiply followed by addition), even if it isn’t exactly equivalent to the normal dot product. But now that I think \epsilon harder, what happens if you apply np.dot in the case where the inputs have more than two dimensions? Maybe we just need to understand what the definition of that would be in higher dimensions …

DTodd · August 31, 2022, 9:23pm

Thank you both, that help clarify things.
I was trying to reproduce the values we get using numpy element-wise multiplication followed by summing all the values (one window at a time), on a matrix foo and a kernel bar - both 2d matrices: It did not work when using np.dot(foo, bar) even if they were the same size.
I could do it as follows:

>>> import scipy.signal as sps
>>> z = sps.correlate2d(foo, bar, mode='same')  # to use padding and keep the original size
or
>>> z = sps.correlate2d(foo, bar, mode='valid')  # to allow the size to shrink

Best, David

TMosh · August 31, 2022, 10:17pm

That’s what Paul is referring to.

Topic		Replies	Views
Program assignment Week 1 question Convolutional Neural Networks coursera-platform	5	514	March 11, 2023
Course 4 Week 1, Step_by_Step Confusion over dot product not convolution Convolutional Neural Networks coursera-platform	3	550	March 19, 2023
Convolution_model_Step_by_Step_v1: How convolution extracts features from input image? Convolutional Neural Networks coursera-platform	3	467	June 10, 2023
W1 HW1 - Possible mistake Convolutional Neural Networks coursera-platform	4	655	June 30, 2021
Week 1 Assignment 1 Convolution Model Conclusion Convolutional Neural Networks coursera-platform	2	564	September 16, 2021

Things to remember -> dot product?

Related topics