Binary classification

paulinpaloalto · December 17, 2022, 5:09pm

Yes, your observations are correct: because of the way these “feed forward” networks work, we need to “unroll” the 3D images (height x width x colors) into vectors. You would think that you lose the geometric information when you do that, but it turns out that the algorithm can learn to recognize the patterns even in the “flattened” form. You’re also right that there are several ways in which you could do the “unrolling”. It turns out that any of them will work as long as you are consistent and handle all the samples in the same way, just as you say.

Here’s a thread which discusses the mechanics of flattening the images and also addresses your point about the different methods.

Later in Course 4, we will learn about Convolutional Networks, which can actually process the 3D images in their original form with more powerful results. Stay tuned for that!

Topic		Replies	Views
Why turn a square of pixels into a one-dim vector? Neural Networks and Deep Learning	7	503	July 23, 2023
Computer vision, RGB, why we use it that way? AI Discussions computer-vision	3	287	February 17, 2024
Feature of data Advanced Learning Algorithms week-1	2	484	November 3, 2022
W4_How do we know what was learned? Neural Networks and Deep Learning	2	580	February 5, 2023
Can anyone explain this? Neural Networks and Deep Learning	4	757	December 27, 2021

Binary classification

Related topics