Week 3 Assignment Binary cross entropy

nramon · June 16, 2021, 10:12am

In the exercise y_pred is a tensor of logits, so it is explicitly computing sigmoid(y_pred), but other than that it is the same formula.

Here is the derivation (from the source code):

  For brevity, let `x = logits`, `z = labels`.  The logistic loss is
        z * -log(sigmoid(x)) + (1 - z) * -log(1 - sigmoid(x))
      = z * -log(1 / (1 + exp(-x))) + (1 - z) * -log(exp(-x) / (1 + exp(-x)))
      = z * log(1 + exp(-x)) + (1 - z) * (-log(exp(-x)) + log(1 + exp(-x)))
      = z * log(1 + exp(-x)) + (1 - z) * (x + log(1 + exp(-x))
      = (1 - z) * x + log(1 + exp(-x))
      = x - x * z + log(1 + exp(-x))
  For x < 0, to avoid overflow in exp(-x), we reformulate the above
        x - x * z + log(1 + exp(-x))
      = log(exp(x)) - x * z + log(1 + exp(-x))
      = - x * z + log(1 + exp(x))
  Hence, to ensure stability and avoid overflow, the implementation uses this
  equivalent formulation
      max(x, 0) - x * z + log(1 + exp(-abs(x)))

Good luck with the assignment

Topic		Replies	Views
Week 3 Programming Assignment : Ex6 Computing Cost Improving Deep Neural Networks: Hyperparameter tun	1	759	July 13, 2021
Not understand about how binary cross entropy calculated in tf Improving Deep Neural Networks: Hyperparameter tun	1	677	July 18, 2021
Cannot compute_cost course 2 week 3 Improving Deep Neural Networks: Hyperparameter tun	91	6716	January 7, 2023
TensorFlow Introduction Improving Deep Neural Networks: Hyperparameter tun	1	678	June 17, 2021
Bug in TensorFlow project Improving Deep Neural Networks: Hyperparameter tun	11	581	August 28, 2021

Week 3 Assignment Binary cross entropy

Related topics