Calculation the number of bias parameter

Jongbum_Won · May 26, 2023, 6:46pm

As I understand, bias term is added after all W^[1]*A[l-1] is calculated (and sum up).
Therefore, I thought that each hidden layer, if we do NOT use convolutional layer, there is one bias term per layer. However, in exercise (quiz) in week 1, it says that “There should be one per neuron”.
(I think it is true for one bias per one filter…)

Could any one explain this? Thank you.

l ↩︎

TMosh · May 26, 2023, 6:49pm

There’s a bias parameter for every activation unit. So if there are “N” units in a layer, there will be “N” bias parameters.

Jongbum_Won · May 26, 2023, 6:52pm

Thank you. Then I have misunderstood!

Quynhu · April 16, 2024, 3:32am

Hi TMosh, I share the same confusion and would like to confirm my understanding based on the conversation above:

` So for dense or fully connected network,
x[l] = a[l-1] * w[l] + b[l]
a[l] = g(x[l])
and b[l] ∈ ℝ
the bias term b[l] is indeed shared among all units/neurons in the layer. This means that b[l] is indeed a scalar value (b[l] ∈ ℝ) shared among all neurons in the layer

while for a fully connected layer in a CNN:
x[l] = a[l-1] * wl + bl
a[l] = g(x[l])
and bl ∈ ℝ
in 1 Fully connected layer with m neurons, the i-th neuron in the layer has its own bl parameter, and they are not shared among neurons.`

Thank you for your time

TMosh · April 16, 2024, 6:22am

For both NN’s and CNN’s, every unit has its own bias value.

Topic		Replies	Views
How many biases are there in one CNN layer? Convolutional Neural Networks	7	911	August 3, 2022
Bias units in CNN Convolutional Neural Networks	1	511	May 13, 2022
C2_W1_Lab02_CoffeeRoasting Advanced Learning Algorithms week-1	3	509	February 7, 2023
Doubts about CNN Example CORRECTION (course mod attention please) Convolutional Neural Networks	1	529	April 26, 2022
Why 4 units in the hidden layer if we have 3 input features? Neural Networks and Deep Learning	4	552	January 26, 2022

Calculation the number of bias parameter

Related topics