Straight Up Stuck

JamesRiley · June 22, 2023, 5:23pm

I’ve done quite a bit of Googling and I’m just flat stuck on how to split my data… My python knowledge isn’t good enough. I understand the need to shuffle, and then create a for loop to iterate through each directory to establish that a given file size is greater than zero, and if it is to move it from source to training directory… and once 90% has been moved, move the rest to validation (also providing file size is greater than zero)… I just simply don’t know how to write that in Python!

balaji.ambresh · June 22, 2023, 5:51pm

Knowledge of python is assumed for this specialization. Please become familiar with python before moving forward.

Happy learning.

TMosh · June 22, 2023, 6:39pm

You could try creating a list of n integers randomly shuffled, then use just the first 10 percent of them as index values.

balaji.ambresh · June 26, 2023, 5:46am

Did you read the original post by @JamesRiley at the top of this topic?
No tensorflow code is required in implementing the functionality for splitting the dataset.

Nicola_Port · June 26, 2023, 6:04am

@JamesRiley I think this will help. It provides instructions for splitting data to create training and validation sets in python for machine learning. Split Your Dataset With scikit-learn's train_test_split() – Real Python

balaji.ambresh · June 26, 2023, 6:11am

Firstly, you’ll have to filter the images that are invalid and then perform the split. Again, you don’t need train_test_split to get the results. Relying on random module and list indexing is sufficient (do look at the imports at start of the notebook).

If you need help with basic python constructs, please confirm and I’d be happy to get the moderator involved. It’s possible that you might be right (although the course assumes knowledge of python) in asking for help at that level and if the moderator agrees, they can change the level of the course from intermediate and the instructor can add a python tutorial as well.

Topic		Replies	Views
Convolutional Neural Networks in TensorFlow - week 1 Convolutional Neural Networks in TensorFlow week-4	6	695	November 29, 2023
My split_data is taking more time to execute Convolutional Neural Networks in TensorFlow week-1	2	534	December 19, 2022
Programming Assignment - Exercise 1 and 2 Convolutional Neural Networks in TensorFlow week-1	6	509	March 18, 2023
C2 W1 problem on creating the split function Convolutional Neural Networks in TensorFlow week-1	1	403	August 12, 2023
Course 2 week 1 GRADED FUNCTION: split_data Convolutional Neural Networks in TensorFlow week-1	3	282	January 23, 2024

Straight Up Stuck

Related topics