C2-W3 Why do we have to transpose "logits" and "labels"

paulinpaloalto · November 5, 2023, 3:41pm

The particular case of compute_total_loss and why the transpose is required is discussed on this thread.

Each case will be determined by the particular circumstances: how the data is formatted and what the operations being used require. One other case I can think of was in the Logistic Regression discussions in DLS C1 W2. There we needed to transpose the weight vector w in order to make the linear activation work:

Z = w^T \cdot X + b

That was because Prof Ng chooses to use the convention that standalone vectors are column vectors. So w has dimensions n_x x 1 and then because X is defined to have dimensions n_x x m in that case (also related to the previous link) we need the transpose in order for the dot product to work.

Topic		Replies	Views
Question with C2W3 assignment Improving Deep Neural Networks: Hyperparameter tun week-module-3 , coursera-platform	1	237	January 23, 2024
Deep learning specialization, Course 2 week 3, coding exercise compute_total_loss Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	431	October 19, 2023
Course 2 Week 3 EX6 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	427	December 12, 2023
Week 3 - Exercise 6 - Compute Total Loss Improving Deep Neural Networks: Hyperparameter tun coursera-platform	8	1349	April 3, 2024
Wk 3, Prog. exercise 6: do I have to reshape the "logits" and "labels"? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	555	October 11, 2022

C2-W3 Why do we have to transpose "logits" and "labels"

Related topics