Convolutional Model Filter Bias

cpedrosa · June 24, 2021, 5:37pm

I have a conceptual question - why do we add biases to the filters when performing a convolutional?

I understand that optimal filter values will be learned by the NN, but having a hard time figuring out why we add bias b. Thanks!

TMosh · June 24, 2021, 9:35pm

NN’s (and other types of supervised learning) always add a ‘bias’ value, so the system can (if needed) learn a constant value that is added to all examples.

paulinpaloalto · June 24, 2021, 11:58pm

Right! It’s a lot more obvious why this is done if you start by considering the case of Logistic Regression or a feed forward network. There you are doing a general “affine transformation”. The simplest case of that is a line in the plane as in:

y = mx + b

In the multidimensional case:

Z = W \cdot A + b

If you omit the bias term, then you can only represent lines (or hyperplanes in the multidimensional case) that contain the origin. That is what mathematicians call “a significant loss of generality”.

So it’s the same argument here with convolutions even though the transformation is harder to visualize: if you omit the bias term, you are putting a significant constraint on the possible solutions you can find. Why limit your solutions in that way if you don’t have to? Of course, it’s always possible that the best solution will end up having a zero bias, but why force that a priori? Just let Gradient Descent and back propagation learn the solution that works best for the particular problem.

Of course this is an Experimental Science: you can run the experiment yourself and check how often we end up with zeros as the bias values. Hold that thought as we go through the course and check it in some cases just to confirm the intuition here.

cpedrosa · June 26, 2021, 2:14pm

Thanks @paulinpaloalto and @TMosh - makes sense!

Topic		Replies	Views
Adding extra weights in NN Advanced Learning Algorithms week-module-4	3	474	January 21, 2023
Understanding bias (b) on a deeper level Neural Networks and Deep Learning coursera-platform	1	509	April 20, 2022
Calculation the number of bias parameter Convolutional Neural Networks coursera-platform	4	522	April 16, 2024
Logstic regression bias NLP with Classification and Vector Spaces week-module-1	6	313	January 4, 2024
Purpose of Bias? NLP with Classification and Vector Spaces week-module-1	11	618	December 1, 2022

Convolutional Model Filter Bias

Related topics