VideoInference in Code : TensorFlow implementation

Rajasekaran_Iyanu · July 9, 2022, 1:51pm

Hi All,

Having a doubt in Handwritten Digit classification" using TensorFlow. It was explained before that the typical input will be 8x8 matrix of Pixel intensity values. But why do we define just 25 Units in Layer1? Could you please explain?

Thanks
Raj
gotoshekar@gmail.com

balaji.ambresh · July 9, 2022, 3:20pm

The number of units in a layer is a hyperparameter that should be tuned based on the dataset and the problem constraints. This is a demo. Do play with the configuration to see how results vary.

Rajasekaran_Iyanu · July 9, 2022, 3:45pm

Thanks Balaji. I was thinking that in Layer1, 64 units might have been specified as the input, 1 per pixel, but it was just 25.

Few more questions:

Looks like the values for Wj, Bj will be changing during forward propagation. In Tensor-Keras framework, by chance, does the “Gradient Descent” (as per the activation of the previous layer) gets computed during forward propagation and the newer w, b values gets passed on to the layers to the right, so as to minimize the cost function ? Pls chime in.
Could you let me know the rationale to make decision about the number of Dense layers required for the final prediction? For eg: For Coffee Roasting prediction, 2 layers were opted, whereas, for Digit Rocognition prediction, 3 layers were opted.

balaji.ambresh · July 9, 2022, 5:30pm

Model parameters (i.e. weights and biases) change only during backward pass. Forward pass is used to compute the loss based on model prediction.

The goal is to maximize the predictive power of the model while obeying the constraints of memory & compute. Hyperparameters like number of layers and number of units per layer are decided on a trial and error basis.

Topic		Replies	Views
MLS course 2 week1 - layers and units in a layer Advanced Learning Algorithms week-1	4	537	November 13, 2022
C2_W1_Lab02_CoffeeRoasting Advanced Learning Algorithms week-1	3	507	February 7, 2023
Hand Written Image Recognition From Scratch Improving Deep Neural Networks: Hyperparameter tun	4	525	November 17, 2021
Course 2 week 3 tensorflow assignment Improving Deep Neural Networks: Hyperparameter tun	1	498	November 1, 2022
C2W1_Lab_Coffee Roasting in Tensorflow Advanced Learning Algorithms week-1	7	494	January 2, 2023

VideoInference in Code : TensorFlow implementation

Related topics