Hi
In the tensorflow implementation of coffee roasting problem, i can see the below plots, which show how each unit is responsible for different bad zones.
What i dont understand is, how does each unit learns to point to different bad zones?
I understand there are different initial weights, but how do the units not overlap ? Can someone please help to clarify?
This is the key to your question. Please check this post out for more explanations, in which you will find a link to another post that shows how you can make them “overlap” (more precisely, how to make the neurons to learn the same thing). If you get the idea of making them “overlap”, then you know how initialization avoid them from “overlapped”.