Week 2: Optimization Methods

starboy · September 2, 2021, 11:19am

I have been trying to figure out this slicing done here.
Can you explain fig. 1 and 2 respectively?

Note: mini_batch_size (here 64)
Thank You.

kampamocha · September 2, 2021, 2:50pm

Maybe would be helpful if you compute those expressions in a python interpreter and see what values are you getting to better understand the slicing. However, I’m gonna try to explain it in simple terms.

The : in the first part says give me all elements from this dimension. The k in the second part is the mini_batch number, so for the first batch k=0, and the slice goes from 0 (0*64) to 64 (1*64), when k=1 tha batch goes from 64 (1*64) to 128 (2*64) and so on. In that way, for each iteration you have a batch of the corresponding batch size, until you arrive at the last batch which doesn’t necessarily have sufficient elements for a complete batch, which lead us to:
Assuming the previous iteration goes num_complete_minibatches times, then the slicing num_complete_minibatches * mini_batch_size: goes from the first element after the processed mini batches to the end of the array (the residual elements that doesn’t make a complete mini batch).

Please let me know if I understood your question correctly and if the response addresses it.

starboy · September 4, 2021, 8:26am

@kampamocha thank you so much for such a beautiful explanation.

Topic		Replies	Views
Week 2, exercise 2 Improving Deep Neural Networks: Hyperparameter tun	13	610	March 1, 2022
Mini batch Gradient Descent: Handling the last mini batch Improving Deep Neural Networks: Hyperparameter tun	6	618	June 10, 2021
Doubt in random_mini_batches (week 2 - exercise 2) Improving Deep Neural Networks: Hyperparameter tun	16	1187	January 10, 2022
Week 2 - Exercise 2 - Mini Batch Improving Deep Neural Networks: Hyperparameter tun	3	509	April 11, 2023
Week 2, Exercise 2 Right Output Improving Deep Neural Networks: Hyperparameter tun	7	534	July 1, 2022

Week 2: Optimization Methods

Related topics