C2W4 - Averaging the total loss

Akingbeni_David · June 1, 2023, 6:54am

From this picture about the distribution strategy, the num_batches is seen to increase by 1. What about in the case when we have more than one device? For instance, we have three devices, which accumulate into three different batches, one for each device.

Does num_batches increase by one for every batch in train_dist_dataset still hold true?

If it does, does the total_loss holds true as the total summed-up loss across the entire replicas?

Edited: I think I figure it out now. The total_loss does not receive the total sum of the loss across each replica, It instead receives the reduced(sum) of the losses across each replica.

Adil_Faruq_Habibi · August 5, 2023, 3:22am

thankyou for sharing @Akingbeni_David

Topic		Replies	Views
Test loss in W4 L2/L3 Custom and Distributed Training with TF week-module-4	1	562	January 6, 2022
Get_batched_dataset in C2_W4_Lab_3_using-TPU-strategy Custom and Distributed Training with TF week-module-4	3	596	August 28, 2021
W4L2. About train_loss and test_loss Custom and Distributed Training with TF week-module-4	5	553	September 2, 2022
A doubt in Week 3 Assignment Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	445	September 4, 2023
Course 4 week 4: Question about triplet loss Convolutional Neural Networks coursera-platform	4	530	August 31, 2022

C2W4 - Averaging the total loss

Related topics