Are there two meanings for 'channel' in Course 4?

Martinmin · February 10, 2023, 1:18am

Please see picture below.

input: n_h_L-1 * n_w_L-1 * n_c_L-1
output:  n_h_L* n_w_L * n_c_L

So the last dimension n_c_L-1 in the Input is the RGB channel (3), but in the output the n_c_L is the number of filters in the output volume, although in both cases they are called channels, or depths.

So the two “channels” have very different meanings. Is that right? In addition, for the channel in the input, is the RGB channel (3) is the most common value? are there any other possibilities for this value?

paulinpaloalto · February 10, 2023, 5:19am

Yes, at every internal layer of a ConvNet which is a “conv” layer, there are the input channels and the output channels. You will soon learn that there are other types of internal layers in a ConvNet besides “conv” layers: there can be pooling layers and fully connected layers as well. But convolution layers have an input which has a certain number of channels. If you’re dealing with the very first conv layer and the input is images, then there will typically be 3 channels in the input for RGB images. But if the inputs are greyscale images, then there will be only one input channel. If it is CMYK images or PNG images, there may be 4 input channels. Then the number of output channels is determined by the number of “filters” you define in that layer. Each filter matches the number of input channels and each filter (when applied to its input) produces one output channel. Therefore the number of filters determines the number of output channels. The number of filters is a “hyperparameter”, meaning a design choice made by the system designer. If the first layer outputs 8 channels, then the second layer will have 8 input channels. And so forth …

Prof Ng will discuss all this in more detail as we proceed through Week 1 of DLS Course 4.

Martinmin · February 10, 2023, 6:05pm

“ Each filter matches the number of input channels and each filter (when applied to its input) produces one output channel”
@paulinpaloalto What do you mean by ‘match’ here? You mean ‘element-wise multiplication’? I understand all other parts of your answers and thanks for that.

paulinpaloalto · February 10, 2023, 6:22pm

Prof Ng explains in the lectures what the atomic operation of convolution is. If you missed that, it would be a good idea to watch the lectures again from the beginning. It is elementwise multiplication between the filter and a particular position in the input (subject to stride) which includes the height and width and (input) channel dimensions, followed by the summation of those products and the addition of a bias term.

Martinmin · February 10, 2023, 6:34pm

Yes，thanks for the explanation.

Topic		Replies	Views
Number of channels not a multiple of the number of channels of the input Convolutional Neural Networks coursera-platform	5	611	March 5, 2022
Lecture - One Layer of CNN - Notations: # of filters should not be = # of channels Convolutional Neural Networks coursera-platform	4	534	December 3, 2022
Channels vs filters Convolutional Neural Networks coursera-platform	1	1014	July 1, 2021
Why different approach for setting filter channels? Convolutional Neural Networks coursera-platform	1	506	April 20, 2022
Simple Convolutional Network Example Convolutional Neural Networks coursera-platform	2	517	May 16, 2022

Are there two meanings for 'channel' in Course 4?

Related topics