We are unable to understand the context here , please help to clarify
This is a lot of outputs, instead of just giving a single class label or maybe a class label and coordinates needed to specify bounding box the neural network unit in this case, has to generate a whole matrix of labels. What’s the right neural network architecture to do that? Let’s start with the object recognition neural network architecture that you’re familiar with and let’s figure how to modify this in order to make this new network output,a whole matrix of class labels.
Does the above statement meaning, final result of the U-net would be whole matrix of class labels right sir?