For multi-label problem (not multi-class)
Should we use
- Output layer :
sigmoid for label 1
sigmoid for label 2
…
- Then what should be the loss function?
sum(logistic_regrssion_loss_function(I)) ?
Also, what will be the TF name for this loss function?
Thanks,
Lizhang