Why is 1st Hidden Layer of Size 4 when Input Layer is of Size 3? Does this apply to a Generalized NN with n-Feature Inputs or is this just a specific example?

Mvv.Kiran · July 26, 2021, 11:48am

This is the way that a Sample 3 Feature input 2 Layered Neural Network is represented for which the equations are laid out.

3 Input Features resulted in a_[1]_1 to a_[1]_4 - 4 Predictions instead of 3. Is 4 here the Number of Samples we are considering? In that case, will Every Hidden Layer have Exactly 4 Predictions - 1 for each of the sample in scope? If this is the case, why does the Final Layer have only 1 Prediction then?

If we were to extrapolate this understanding that there will be n Features in the Input X, then the First Hidden Layer would a_[1]_1 to a_[1]_(n+1) Predictions and hence, there would be “n+1” w and b parameters as well? Is this the right understanding?

If yes, then WHY are we adding one more parameter to the Predictions sent to the First Hidden Layer? Is there an underlying reason for this addition?

NOTE: Did we not create the equation z = w.T * X + b just so that we could do away with the θ * X calculations which involves the θ_0 parameter where X_0 is assumed to be 1 and this θ_0 * X_0 (θ_0 * 1) is replaced with b? Then what is the reason for the addition of 1 more parameter to the First Hidden Layer’s a_[1] Vector?

Mvv.Kiran · July 27, 2021, 1:00pm

I think my question has been answered in the C1W3 Video Gradient Descent for Neural Networks where, for a 2 Layered Neural Network’s 1 Hidden Layer will have n_[1] number of Output Variables, resulting in:

w_[1] having the dimensions of n_[1] * n_[0] where n_[0] = n_X
b_[1] having the dimensions of n_[1] * 1

If we were to extrapolate this understanding to a Multi-Layered N.N. with L hidden Layers in addition to the Input Layer and Layer L being y_hat, then:

w_[l] will have the dimensions of n_[l] * n_[l-1]
b_[l] will have the dimensions of n_[l] * 1
a_[l] will have the dimensions of n_[l] * 1

where 1 <= l <= L with L being the total number of hidden layers.

I was referring my old notes on Mr. Andrew N.G.'s ML Course using GNU Octave & Matlab where N.N. was covered extensively and this video sort of provided the explanation as well. Keeping the question in case anyone has a similar question and would like to look at the explanation for it.

Now WHY does a Hidden layer have a different set of dimensions will be answered by a Reverse question of why not? We have no idea what a Hidden layer looks like unless we programmatically decide its dimensions so the more we get into the programming aspects of it is when we get to decide these parameters - I think…

Topic		Replies	Views
Week 4 - 2nd Assignment - layer_dims? Neural Networks and Deep Learning	23	888	June 2, 2024
DLS W3 assignment question Neural Networks and Deep Learning week-3	5	31	July 11, 2024
Question regarding week 3 video 3 "computing a nn's output" Neural Networks and Deep Learning week-3	2	17	October 21, 2024
W3_Computing a Neural Network's Output Neural Networks and Deep Learning	1	488	April 10, 2023
Week 3: Vectorizing Multiple Examples' video Neural Networks and Deep Learning	4	544	June 23, 2021

Why is 1st Hidden Layer of Size 4 when Input Layer is of Size 3? Does this apply to a Generalized NN with n-Feature Inputs or is this just a specific example?

Related topics