How do different neurons and layers specialize?

Sashka_Warner · November 12, 2022, 4:48am

I have completed the first two courses in the machine learning specialization, and am almost done with the reinforcement learning section of the third course.

However, I still don’t understand how different neurons and layers focus on different parts of the input data.

For example, in the lesson Example: Recognizing Images, Dr. Ng states that in the first layer one neuron might look for lines oriented in one direction whereas another neuron might look for lines oriented in another direction. Dr. Ng goes on to say that in the second layer the neurons will focus on identifying parts of a face at a larger scale.

I understand the basics of forward and back propagation, and I understand how in a convolutional neural network convolutions can be used to detect edges.

However, I still don’t understand how the neurons and layers know how to specialize into focusing on different elements. For the neurons, I thought initially that if each neuron is created with different random weights and b values then that could help explain the specialization, but I thought I read somewhere else that you can initialize all neurons with the same weights.

If anyone can help me understand how the neurons and layers know to focus on different things I would really appreciate the help!

TMosh · November 12, 2022, 5:46am

The neurons don’t know what sort of pattern they are looking for. The weight values are simply adjusted to minimize the cost.

If detecting an edge, or a line orientation, is useful to minimize the cost, that’s just how the weights will turn out. There’s no pre-determined assumption about what the neurons will learn.

Andrew is just giving some intuitive context for how you might understand what the NN is doing, and why it gives useful results.

rmwkwok · November 12, 2022, 10:13am

Hello @Sashka_Warner,

I disagree that initializing neurons with the same set of weights guarantees neurons to capture different features. Please refer to this post for why neurons can differentiate. In the post, there is another link to a post with some mathematical explanations.

Raymond

Sashka_Warner · November 12, 2022, 11:01pm

Thank you so much for the reply, that’s helpful!

I might be missing the point, but if we continue the example of identifying edges and identifying facial components from an image:

Is it possible for neuron #1 in Layer 1 to identify edges, while at the same time neuron #2 in Layer 1 identifies facial components?

In other words, do neurons in a given layer always work at the same “scale”?

Sashka_Warner · November 12, 2022, 11:04pm

Hello @rmwkwok,

Thank you very much for the reply! That post you shared that shows the back propagation calculations is extremely helpful! I think that would be a great addition to the lecture slides!

Thanks,
Sashka

rmwkwok · November 12, 2022, 11:36pm

You are welcome, Sashka!

shanup · November 13, 2022, 6:54am

Hello @Sashka_Warner

Unfortunately, this is not something that we can consciously control. It is the magic of the math that decides all the pieces, so that they can all cohesively contribute to the output at the final layer.

We set the learning algorithm on a task to reduce the \frac {dJ} {dw}. We further allow it to apply the chain rule so that J can backpropogate to all the layers. From here on the math takes over. The end result being that different neurons learn different features (edges, parts of the image etc). In this manner, each neuron thereby contributes towards the creation of the final prediction.

TMosh · November 13, 2022, 7:48am

No guarantees about that. The layers learn whatever will give the minimum cost.

TMosh · November 13, 2022, 7:49am

It’s not really an example - it’s one of Andrew’s intuitive explanations for a very complicated topic.

Sashka_Warner · November 13, 2022, 6:02pm

Hello @shanup ,

Thank you very much for the reply! Got it - thank you for the help!

Best,
Sashka

Sashka_Warner · November 13, 2022, 6:03pm

Thank you very much for the follow-up reply! I see- thank you for answering my questions! I appreciate it!

Sashka_Warner · November 13, 2022, 6:03pm

Got it - makes sense! Thanks!

Topic		Replies	Views
Why do NN nodes tend to specialize in their behavior? Advanced Learning Algorithms week-module-2	5	540	April 23, 2023
C2W1 Individual Neurons and Classification Advanced Learning Algorithms week-module-1	18	1462	November 23, 2022
How do units within the same layer end up with different weights? Advanced Learning Algorithms week-module-2	3	710	July 28, 2022
Global Minima In Neural Network Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	500	May 9, 2023
Machine Learning specialization C2_W1,problem with tensor flow implementation Advanced Learning Algorithms week-module-1	3	554	April 21, 2023

How do different neurons and layers specialize?

Related topics