Mini-batch working principle

Stpwlf · September 6, 2021, 10:58am

Hello, I have a question according to the working principle of Mini-batch gradient descent. If I got it right in each epoch I end up with several cost functions J{t} as well as several weight matrices W[l]{t} and biases b[l]{t}. How do I combine them?

Thank you for your help in advance!

kampamocha · September 6, 2021, 4:11pm

Hi @Stpwlf,

It’s not that you need to combine different cost functions, weight matrices, and biases. Is that you update them more often, one time per mini-batch, instead of one time per epoch.

I hope I understand your question well and my answer is helpful.

Stpwlf · September 6, 2021, 7:40pm

Hi @kampamocha,

yes that helped a lot. Thank you!

Topic		Replies	Views
Confused about Mini-Batch Gradient Descent Improving Deep Neural Networks: Hyperparameter tun	3	556	May 9, 2022
Understanding batch gradient descent over the entire training set Neural Networks and Deep Learning week-4	9	138	August 5, 2024
Gradient steps in Mini batch vs batch Improving Deep Neural Networks: Hyperparameter tun	4	787	May 18, 2021
Mini-batch understanding Improving Deep Neural Networks: Hyperparameter tun	8	668	March 7, 2023
Week 2 - When to use mini-batch gradient descent Improving Deep Neural Networks: Hyperparameter tun	1	547	June 15, 2021

Mini-batch working principle

Related topics