Read this before diving into neural networks

popaqy · June 22, 2022, 9:27am

I’ve had a hard time figuring out what exactly the neural networks actually do.
We were told by Andrew that this is some kind of an automatic feature engineering. And since then I was wondering how exactly each layer figures out its new features.

Let’s say we have the initial features x1, x2, x3.
I used to think that what the neural network do is to come up with more sophisticated features - x1 * x2 * x3, x1^2 and so on. It turned out that this is not actually the case

The case is that each layer is just a transformation of space.

If you are not aware of transformations you should definitely watch 3blue1brown’s series - https://www.youtube.com/playlist?list=PLZHQObOWTQDPD3MizzM2xVFitgF8hE_ab

So we are kind of squishing space with each layer, trying to get the data in a comfortable format that we can finally draw a line upon and have it separated. As in the case below:

Now there are cases that your neural network is badly designed and the transformations you are applying are not appropriate. Example:

It is beautiful how we can generalize such a complex model, don’t you think?

this article cleared the things for me - Neural Networks, Manifolds, and Topology -- colah's blog

CoderCatA5 · June 23, 2022, 6:44pm

This is a nice way of putting it.

Thanks for sharing!

Levent · June 30, 2022, 10:28pm

Amazing article! Thanks! But I have a question: does making such topological transformations of the original dataset are actually equivalent to find sophisticated features like the combinations of x^n that you mentioned?

rmwkwok · July 1, 2022, 3:21am

Hi @Levent,

Please do play with the course 2 week 2 lab for “ReLU activation”, because there you will find how we can use ReLU to approxmiate a curve with a linear piecewise function.

Cheers!
Raymond

deeplearner_2x · July 11, 2022, 8:59am

Wow ! This is just awesome !
Also were you able to find any visualizations (like the ones in your link) which shows how a sigmoid function transforms the input space?

NAYANDEEP_TIWARI · July 12, 2022, 3:38am

This is just awesome ! Thanks for sharing.

popaqy · July 22, 2022, 9:59am

I do not get the connection between the question and the answer you give. Can you be more explicit?

rmwkwok · July 22, 2022, 12:23pm

Hello @popaqy,

Why don’t we show a screenshot of the ReLU lab:

Screenshot from 2022-07-22 20-17-45

The neural network in the lab takes in one feature x, and with 3 neurons in the first layer, as shown on the left, it’s approximating a x^2 feature in a limited range of x with 3 linear piecewise equations.

Raymond

lea_kojicic · August 24, 2022, 9:42pm

Not exactly. Check out eigenvectors and linear operators for more understanding on the topic

Gabor_Farkas · August 25, 2022, 2:26pm

Great visualization, thanks!

Thiago_de_Bastos · September 13, 2022, 8:50am

Thank you so much for sharing this! @popaqy !

Topic		Replies	Views
Neural networks - just automatic feature engineering? Advanced Learning Algorithms week-1	2	534	June 21, 2022
Understanding how neural network work Neural Networks and Deep Learning coursera-platform	8	573	May 28, 2023
Feature Engineering in Neural Networks Advanced Learning Algorithms week-1	1	564	July 15, 2022
Neural networks are a totally different way of thinking about data Advanced Learning Algorithms week-1	4	619	July 7, 2022
Feature engineering in the input layer? Advanced Learning Algorithms week-1	6	628	December 13, 2022

Read this before diving into neural networks

Related topics