Hi everyone,
I encountered a question that bother me; Probably it will be answered during the course, but still:
While, for instance, the first layer outputs vector “a” with 5 values, and the second layer, which has 3 neuron units, takes it as input. What’s the difference between each logistic regression within the layer( how do those 3 neuron units differ)?
Since I thought they should be completely the same, since they all minimize cost function, no? Or do values of parameters (w,b) differ somehow from neuron units within one layer?
may be I misused some terminology, I would appreciate pointing it out as well as an answer to this question.
Thanks!