How is back query usefull in this situation to visualize learned representations?

Stephano_Cotsoradis · February 23, 2024, 11:30pm

Context: This code is based on a 3 layer fully connected neural network coded from scratch(no libraries)trained on had written numbers 0-9. This back query code will then take in an output value of 0.99,0.01,0.01,0.01,0.01,0.01,0.01,0.01,0.01 and then its run backward through the network to get the pixel values at the beginning of the network to see what the defention of a 0 is to the network.

So my question is after the inverse sigmoid is applied amd that vector is multiplied by the vector of the transposed weight matrix how is that supposed to give me the activation values from the layer previous because if I do a dot product between two matrices 𝑊∗𝑋=𝑍 and then transpose 𝑊.𝑇∗𝑍 that does not give me X ? So then how could back query be useful ? It clearly is useful cause when I run the code it shows me the networks idea of a 0 but cant piece together how it works in my head.

So in short how does back-query give me this representation visual if it gives completely different values for the activations going backward then going forward ?

Stephano_Cotsoradis · February 23, 2024, 11:43pm

@rmwkwok I was wondering if you have any insight on this I have asked this question in many places and cant find a satisfactory answer

TMosh · February 24, 2024, 12:37am

I think what you’re trying to do is create an autoencoder (use the input to train the output, then use the output to approximate the input).

Stephano_Cotsoradis · February 24, 2024, 12:41am

Its traning a neural network like normal but after training is done just inputting an ideal output inot the end of the network and going backwards to just see what the network has learned for its defenition of a #0 in this case there is no encoder decoder

TMosh · February 24, 2024, 1:01am

I know you don’t have a full autoencoder. I think you need one.

Back-feeding the output to re-compute an input is probably not useful. I’ve tried it before and it always looks like mush. I’ve never looked into it in detail.

rmwkwok · February 26, 2024, 1:22pm

Hello @Stephano_Cotsoradis,

I don’t want to look at the codes. Can you fill in the question marks for you are implementing them? We can discuss this next time.

Please also study what Inverse of a Matrix means, because you will need that.

Cheers,
Raymond

Stephano_Cotsoradis · February 26, 2024, 8:10pm

In my code for backquery (NOT BACK PROP)

for context a2 is a perfect representation of a digit(0.9,0.01,0.01,0.01,0.01,…,0.01).

z2= inverse of sigmoid(a2)

a1=W2.T * z2
a1=a1-min(a1)
a1-a1/max(a1)
a1=a1 * 0.98
a1= a1+0.01

z1=inverse sigmoid(a1)

X=W1.T * Z1
X=X-min(X)
X=X/max(X)
X=X*0.98
X=X+0.01

When I plot the X with matplotlib the image sometimes roughly resemble the number that was inputted into the back of the network and back queried but sometimes it looks like complete mush of pixels

Stephano_Cotsoradis · February 26, 2024, 8:13pm

like here when inputed : [0.01 0.01 0.01 0.01 0.01 0.99 0.01 0.01 0.01 0.01] for digit 5 I can see it

and same here you can kind of see a digit 3

but then for 4 or 0.01 0.01 0.01 0.01 0.99 0.01 0.01 0.01 0.01 0.01] it looks like complete mush

rmwkwok · February 27, 2024, 1:48am

I need to first know whether your maths are correct. If you can’t prove that your maths are correct, why would you believe that your code was correct, given that your code is nothing but implementing the maths?

I don’t skip the maths here. It’s your choice of whether you actually want me to have a look If you want me to have a look, please share what those question marks are in the form of math equations (not code).

Thanks,
Raymond

TMosh · February 27, 2024, 3:31am

The challenge here is that you want to reverse the forward propagation equation, so that you can provide the output label, and compute the most likely corresponding image.

So you need to write out the equation for forward propagation, it’s going to look some thing like:

a_out = softmax(sigmoid(X * W1 + b1) * W2 + b2)

Then solve that equation so you have X = on the left side of the equation, with W1, W2, b1, b2, and a_out on the right.

If you can write out the math, then you can implement it.

Stephano_Cotsoradis · February 27, 2024, 3:12pm

@TMosh @rmwkwok Sorry for being confused but how was the response I sent not the math ? I wrote the equations that I used to get my answer. Z2= then a1= then z1= etc… Thank you guys for helping me just trying to understand what exact response you want.

Stephano_Cotsoradis · February 27, 2024, 3:15pm

Is it that you don’t want for example a1=W2.TZ2 but instead a1=W2.Tinverse sigmoid (A2) ? So like all the steps for each and not shorten it buy using the variable from the previous calculation in the next one ?

TMosh · February 27, 2024, 4:03pm

What exactly is “inverse sigmoid”?

Stephano_Cotsoradis · February 27, 2024, 7:10pm

ln(x/1-x)

TMosh · February 27, 2024, 10:09pm

Are you sure that transpose is the correct operation here?

Neglecting the bias for the moment.
In general, you have some A = Z * W from forward propagation.

Now you’re trying to reverse that operation in order to get Z based on having A and W.

So in concept you want to multiply both sides by the “inverse of W”, so you have
Z = A * "W inverse"

But in practice, since we’re using matrices, W only has an inverse if it is square. But W is almost never going to be square, except in very limited circumstances.

Stephano_Cotsoradis · February 28, 2024, 12:09am

I got this code following a book called Build Your Own Neural Network by tariq rashid and was just following along in the code on the git hub and it is transposed the weights. I think the reason for this is because like you said W only has inverse if it is a square so i think thats why the transpose was used instead and it produced some okay results. Heres link:makeyourownneuralnetwork/part3_neural_network_mnist_backquery.ipynb at master · makeyourownneuralnetwork/makeyourownneuralnetwork · GitHub

TMosh · February 28, 2024, 12:21am

There is a pseudo-inverse method that works on any size of matrix. But it’s a lot different than using a transpose.

rmwkwok · February 28, 2024, 2:49am

I still recommend you to go through the maths, because then you will be able to realize limitations that can cause unreasonable results.

Going through the maths does not mean showing how the code does the math, but how you justify them. Writing the equations down is only the first step. You correctly pointed out the problem with inverse, but how does that justify transpose? You said transpose can give you ok result, is your result ok? You shared someone’s code for “back query”, is that code always giving you ok result? I can’t defend the use of transpose for someone else. You said the code comes from a book, how did the book justify it?

Tom’s psuedo inverse is an inverse that works for non-square matrix, but it has its problem too, because a full-rank non-square matrix can either be over-determined or under-determined, and if you studied linear algebra, you should know what that means (although problem can also be a source for interesting findings ). If you experiment carefully, you will understand more on that problem and probably will find more problems along way (e.g. what happen if inverse sigmoid receives numbers outside of the range between 0 and 1? What justifies just clipping the numbers off? If you can’t justify it, what if you change sigmoid to something else to avoid this problem?).

I can’t guarantee you that that “back query” will work, but you can study what problems it can face. After understanding them, maybe you can think about how to tackle those problems (if possible), or maybe you can be relieved and think about: (1) why don’t I just display training samples that rank highest? (2) should I search for another way?

This process may not yield a working “back query”, but it will yield a more solid understanding of the problem and THIS is what you can definitely get.

Cheers,
Raymond

Stephano_Cotsoradis · February 28, 2024, 6:35am

Wow thanks @rmwkwok that helped alot. I need to find the reasons for why the back querey should or should not work and then start chaging thinks to solve or discover new problems. The back query might be good for the purpose in the book but bad in the context of other things. Thank you for this very insightfull response I will contunue to explore this from diffrent angles. I think going super deep into these fundamentals without any lbraries to abstract anything will do wonders for me when I am manipulating or coming up with my own new nn architechtures in the future because I will be able to see the numbers flow through the networks perfectly and see how to iterate and come up with my own methods !

rmwkwok · February 28, 2024, 7:21am

I see! Good luck for your journey!

Cheers,
Raymond

Topic		Replies	Views
Want to discuss about My code Neural Networks and Deep Learning	7	568	May 21, 2022
Week4 - Building Blocks of Deep Neural Networks Neural Networks and Deep Learning	3	557	October 27, 2021
C1- W4 exercise 9 Neural Networks and Deep Learning	16	488	May 25, 2023
W4_A1_Ex-9_L_model_backward Neural Networks and Deep Learning	2	511	November 15, 2022
My representation graph of NN implementation, and a question about a variable Neural Networks and Deep Learning	3	594	March 14, 2022

How is back query usefull in this situation to visualize learned representations?

Related topics