Labels for tf multiclass problems

Tom_Fuller · February 7, 2023, 2:40am

In the example of the digit recognizer how do we know that the index of the output activation corresponds to a particular symbol (digit) i.e. how do we know, or how did tf organize the units such that second index of the output activation vector corresponds to the z/probability of it being the symbol “2”. What if it was “dog”, “cats”, “other” not digits?

TMosh · February 7, 2023, 2:42am

It’s based on prior knowledge of how the labels were applied to the data set.

For this assignment it’s just lucky that the information conveyed in the images of the digits happen to match some handy numerical codes.

If you were trying to identify other sorts of things, you’d have to know the mapping between the object names and their numerical codes.

Tom_Fuller · February 7, 2023, 2:54am

Thanks for the quick response! I’m not sure I still completely understand. Is tf in the background creating a set of unique label values and that the order of discovery of these labels is the order of the output vector values i.e. say the labels where strings 0…9 and the data-set just happened to have been ordered such that the first unique label discovered in the set was “9” and the last was “0”, would a[0] is assumed to relate to “9” or is tf doing some other sorting of the labels.

TMosh · February 7, 2023, 3:18am

The labels are applied by whoever prepared the data set. TensorFlow is just a method for performing machine learning - it doesn’t do anything to create data sets.

Fundamentally, for each example X[i], someone created a corresponding y[i] value as its label. Then they saved the data set (X and y) as data files, and packaged it with the notebook.

ai_curious · February 7, 2023, 2:39pm

You might want to take a look at tf.keras.utils.image_dataset_from_directory | TensorFlow v2.11.0

main_directory/
...class_a/
......a_image_1.jpg
......a_image_2.jpg
...class_b/
......b_image_1.jpg
......b_image_2.jpg

Then calling image_dataset_from_directory(main_directory, labels='inferred') will return a tf.data.Dataset that yields batches of images from the subdirectories class_a and class_b , together with labels 0 and 1 (0 corresponding to class_a and 1 corresponding to class_b ).

Works for larger sets of classes, too. You can do that every time and read the images into X and have TensorFlow create the labels, Y, automagically. Or, do it once and write your X and Y back out as .npy files or TF datasets, ready for quick reload. HTH

Topic		Replies	Views
Fix of another small annoyance in “Tensorflow Introduction” assignment Improving Deep Neural Networks: Hyperparameter tun week-3 , coursera-platform	3	18	March 26, 2025
Multiclass - class values Advanced Learning Algorithms week-2	17	528	December 25, 2022
Create Dataset to Detect multiple objects AI Discussions	0	62	November 9, 2023
C3W2 Assignment Labels Shape Natural Language Processing in TensorFlow week-2	7	33	March 10, 2025
Assignment - Tensor Flow - Using One Hot Encoding Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	592	April 30, 2021

Labels for tf multiclass problems

Related topics