Gradient Descent update of variables have more than 1 path?

Phan_Phuoc · May 6, 2025, 6:08pm

Hi everyone,
I just read an article about VAEs and i can see that after go through the encoder, we will have means and standard deviations, then it will pass through two path, one for KL loss and one to compute reconstruction loss.
My question is, if we have two or more path to compute, then how we can update the variables ? In this case, KL loss and retcon loss have different space (or norm) to measure so we can just average it or sum it up cause too big.

balaji.ambresh · May 7, 2025, 3:10am

Please see Define the VAE as a Model with a custom train_step
section.

Gerald_Wainaina · May 9, 2025, 2:22pm

The two KL and Reconstruction losses are combined into a single loss. Since the losses are in ndifferent spaces, most implementations weight the KL term using a hyperparameter, B. During training the gradients of the combined losses with respect to the network parameters are calculated using backpropagation. The optimizer uses this gradients to update the parameters in a way that minimizes the total loss…

Topic		Replies	Views
Gradients of Multi output models Custom and Distributed Training with TF week-1	3	581	May 22, 2022
About gradient descent with multioutput networks Custom Models, Layers and Loss Functions with TF week-1	1	520	September 22, 2022
Simultaneous update of parameters w & b in Gradient Descent in Multiple Linear Regression Supervised ML: Regression and Classification week-2	5	472	May 30, 2024
Calculating loss with Multiple Input Single Output scenario AI Discussions ai-discussions	2	76	March 1, 2024
Why does the function for KL Reconstruction Loss in the Variational Autoencoder Lab contain input args `inputs` and `outputs` but not use them? Generative Deep Learning with TensorFlow week-3	2	567	April 28, 2023

Gradient Descent update of variables have more than 1 path?

Related topics