Why Align Channels in Convolution?

Shiori_YAMASHITA · November 3, 2022, 2:08am

I have a question about 3D convolution. It was recommended here to set the filter so that the number of channels in the input matches the number of channels in the filter. I would like to know a little more about why.

When convoluting each pixel vertically and horizontally, we were able to detect vertical and parallel boundaries in the filtered small portion of the image. Similarly, in 3D, wouldn’t it be possible to find some meaning by detecting only one of RGB? It turned out to be an abstract question, but I hope my intention is conveyed.

paulinpaloalto · November 3, 2022, 3:23am

The definition of how the convolutional filters are applied is that is in 3 dimensions: height, width and depth. The examples Prof Ng gives in Week 1 concentrate on the h and w dimensions, but if you watch carefully he does talk about how the channels work.

Note that at each convolution layer it is a “hyperparameter” how many output channels you have. But each filter needs to match the h, w and c dimensions of the input.

Topic		Replies	Views
Question about channel dimension matching Convolutional Neural Networks coursera-platform	4	381	September 4, 2023
Why different approach for setting filter channels? Convolutional Neural Networks coursera-platform	1	496	April 20, 2022
W4_Quiz_3D Convolution Convolutional Neural Networks coursera-platform	6	1458	August 7, 2022
Question about W dimension Convolutional Neural Networks coursera-platform	2	528	August 19, 2021
Filter dimensions Convolutional Neural Networks week-module-1 , coursera-platform	2	238	March 2, 2024

Why Align Channels in Convolution?

Related topics