So in the youtube link: My Computer Taught Itself to Play Minecraft - YouTube. I saw very interesting about how Deep Neural Networks are able to make AI smarter. There’s something that i didn’t really understand though. So it says that we received inputs like the rarest ore which is Dirt, Stone, and Iron Ore. Then the output will be the movement, like moving right, left, front, and backward. I want to summarize how it works, and please correct me.
So it received inputs, and since this is a multi-class, we’ll use Softmax Regression. So it says that based on the weights, there are probabilities in the output. And i’m assuming that this youtuber used Hard Max since, when i see that, every time the weight initialize randomly, it only do certain output. Like there are 7 units in the output layer, but only 2 used. So does my summary seems to be true? If not, please do tell me.
Oh and also one more thing is that, i forgot that i’ve learned in this course, but how exactly that the NN knows the Y answer? Thanks.