Hello, I am going through the 2nd week and I had a quick question about why Andrew mentions that log y_hat should be large? https://www.coursera.org/learn/neural-networks-deep-learning/lecture/yWaRd/logistic-regression-cost-function
Shouldn’t we want the loss function to be as close to zero as possible? Then why do we want log(y_hat) to be large?
Thanks