Regarding data augmentation

zonehuang · May 27, 2024, 4:10am

Since we primarily aim to recognize objects in images, why don’t we just use data from one of the image’s layers, or a grayscale image, as input and training data for the neural network?

I believe that just as the human can recognize objects in grayscale images, a neural network should be able to do the same. If we use all three RGB layers, I think the data from these layers are quite similar, which might lead to overfitting. Is that right?

TMosh · May 27, 2024, 4:44am

No, it’s not correct.

You can use a grayscale image, this is true. But there is additional information available in a color image.

More information is usually a good thing.

There are other methods to control overfitting.

Alireza_Saei · May 27, 2024, 8:24am

Hey there @zonehuang

Using grayscale images for object recognition is feasible and can reduce computational complexity, but RGB images provide more information about each channel (Red, Green, Blue) which can improve accuracy.

While grayscale might suffice for some tasks, RGB inputs often enhance performance and generalization by capturing a better set of features, thereby reducing the risk of overfitting compared to using grayscale.

paulinpaloalto · May 27, 2024, 2:57pm

Right! And of course it also critically matters what your goal is. If the stated goal is to distinguish between grey cats and brown cats, then greyscale images will not suffice.

Nevermnd · May 27, 2024, 3:31pm

@zonehuang personally (and I am not sure-- have not tried/tested it) I do think you have an interesting idea here-- Insofar as the way ‘we’ do it, color adds some, but I would not say (necessarily) the ‘crucial’ component of what exists in the image, aside from certain particular classifiers as others have depicted.

So I wonder if you could do your train set on grayscale, which would probably save you a lot of time, and then later ‘add back in color’ via transfer learning.

I mean, upfront, you’d have the challenge your layers are not the same size (or perhaps you could just add null layers during the training ?).

No idea if it would work… Just speculating…

Topic		Replies	Views
Color shifting vs grayscale Convolutional Neural Networks coursera-platform	4	785	June 7, 2022
Data Augmentation/ Collection Convolutional Neural Networks coursera-platform	2	275	November 30, 2023
Regularization on images Neural Networks and Deep Learning coursera-platform	2	626	November 5, 2022
Convolution of a 3D Greyscale Image Convolutional Neural Networks coursera-platform	2	504	July 14, 2022
Week 2, Transfer Learning Convolutional Neural Networks coursera-platform	1	529	January 14, 2022

Regarding data augmentation

Related topics