Matrix Multiplication Question

Amit_Misra1 · August 16, 2023, 11:54am

In the matrix multiplication example where Andrew is showing multiplying Atranspose [200 17] by a 3x2 matrix W, the output is a 1 by 3 matrix which then has the sigmoid function applied to give [1 0 1]. My question is, what does that mean practically? I thought the point was to come to the a conclusion of 1 or 0, not a 1 by 3 matrix?

I’m assuming it needs to go into the 2nd layer to output a scalar but I don’t see how that’s done in the course thus far. Thanks in advance.

TMosh · August 16, 2023, 6:13pm

If there are three units in the hidden layer, you’re going to get three outputs.

Note that technically they are floating point values between 0 and 1 - not just 0 or 1.

The hidden layer outputs are combined in the output layer.

Amit_Misra1 · August 17, 2023, 9:48am

Thanks TMosh. Does that output of [1 0 1] go into the final/purple layer then to give a scalar? I’m assuming this example does not end at [1 0 1] is what I’m getting at.

Ilya_Albrecht · August 17, 2023, 2:07pm

Layer 1 output is a 1x3 matrix of float values between 0 and 1 each, which are “predictions” from each neuron on layer 1. Those floats are then fed to level 2 to produce the final prediction, which is a single float value between 0 to 1, that can be converted to boolean true or false or just fixed point 1 and 0 values using some threshold.

TMosh · August 17, 2023, 2:46pm

Yes.

Nilanjan_Paul · August 18, 2023, 2:41pm

Imagine each neuron as a judge and the layers as a jury of members. Each time they are given a question or a situation, they give a response. Imagine the juries sitting in their benches and their reply is the value they present - the value presenting is the matrix based on their position.

In response to how the matrix data goes from one layer to the other and if they flow, when the verdict is given from a set of juries (neurons) - they can be passed to a court of higher importance (maybe the High Court). Now the verdict can be from a single court or from multiple courts. Thus the resultant matrix to the output layer can be a scalar matrix or a vector matrix having both rows and columns.

Amit_Misra1 · August 18, 2023, 2:58pm

Beautifully said thank you.

Topic		Replies	Views
The output of a neural network layer Advanced Learning Algorithms week-1	8	522	September 28, 2023
Sigmoid function within each layer Advanced Learning Algorithms week-1	6	572	November 27, 2022
What the heck is W? And how do you multiply it? Neural Networks and Deep Learning coursera-platform	4	602	March 23, 2022
Week 3, Video 3: Understanding matrix size Neural Networks and Deep Learning coursera-platform	7	657	January 29, 2025
Matrix multiplication lecture clarification - NN - Why do we transpose at all Advanced Learning Algorithms week-1	1	350	September 17, 2023

Matrix Multiplication Question

Related topics