Concept of "linear" in the last homework

pav · May 16, 2022, 8:38pm

I am struggling to understand what the “LINEAR” part is referring to in the Deep Neural Network - Application assignment (section 3.1), other than the input vector that has been flattened for the first LINEAR layer and then the outcome of the first function for the second LINEAR layer (that the activation function is applied to).

INPUT → LINEAR → RELU → LINEAR → SIGMOID → OUTPUT

What would be the best way to conceptualize or understand this?
Thank you!

paulinpaloalto · May 16, 2022, 11:22pm

Prof Ng explained this in the lectures. The processing at each layer of a neural network consists of two steps:

The “linear activation” which is expressed by this formula:

Z^{[l]} = W^{[l]} \cdot A^{[l-1]} + b^{[l]}

The “nonlinear activation” which is the elementwise application of the non-linear activation function for the layer to the Z^{[l]} value:

A^{[l]} = g^{[l]}(Z^{[l]})

In the first layer of the diagram you show, g^{[1]}() is ReLU and in the second layer g^{[2]}() is sigmoid.

Note that step 1 is what is called a “linear transformation” in mathematical terms. Well, if you want to go “full terminology” it is an “affine transformation” which is a particular type of linear transformation.

pav · May 17, 2022, 12:10am

Thank you! Due to scheduling it has taken me a while to get to week 4 so I had forgotten but your post was a good refresh. Thank you again!

Topic		Replies	Views
Week 3 - Video (1-3) Neural Networks and Deep Learning	6	562	April 9, 2022
W4 assignment clarification needed Neural Networks and Deep Learning	4	536	January 15, 2022
Mistake in intro video? Linear Algebra for Machine Learning and Data Sc... week-3	3	28	November 12, 2024
Non-linearity in Z Neural Networks and Deep Learning	1	501	September 13, 2022
What does activation actually means? Advanced Learning Algorithms week-1	5	528	January 20, 2023

Concept of "linear" in the last homework

Related topics