Get_batched_dataset in C2_W4_Lab_3_using-TPU-strategy

Chris.X · August 27, 2021, 9:17pm

Hey there,

Why do we use BATCH_SIZE for the size of batch in get_batched_dataset, not the BATCH_SIZE * strategy.num_replicas_in_sync in lab C2_W4_Lab_3_using-TPU-strategy?

Is this a bug in C2_W4_Lab_3_using-TPU-strategy?

Compare with other labs C2_W4_Lab_2_multi-GPU-mirrored-strategy and C2_W4_Lab_1_basic-mirrored-strategy.

thank you

gent.spah · August 28, 2021, 7:27am

Hi there,

At the compute_loss function at this lab is returning global_batch_size=BATCH_SIZE * strategy.num_replicas_in_sync which is basically what is doing in the previous labs when setting the setting the GLOBAL_BATCH_SIZE parameter above.

Chris.X · August 28, 2021, 8:35am

@gent.spah thanks, but…

I see the same approach at computing the loss from those labs.

My point there is about creating the dataset. They are different, or the size of batch is randomly?

gent.spah · August 28, 2021, 9:26am

As far as I can see having two different computational strategies the way the data is fed in is a bit different but the principle is the same. The dataset is the same, then is shuflled and then a batch size is chosen to be fed (per replica or global depending on the strategy) but the rest of the computations add up and account of all resources used. The batch size in itself can be any but normally is choosen a power of 2 because of the binary logic.

Topic		Replies	Views
C2W4 - Averaging the total loss Custom and Distributed Training with TF week-module-4	1	511	August 5, 2023
Test loss in W4 L2/L3 Custom and Distributed Training with TF week-module-4	1	562	January 6, 2022
C2W4 Assignment dataset changed size? Custom and Distributed Training with TF week-module-4	1	236	March 2, 2024
Exercise 4 - utils2 function get_batches is that right? NLP with Probabilistic Models week-module-4	2	412	September 19, 2023
Does a batch size of 512 speed up training? Convolutional Neural Networks in TensorFlow week-module-2	2	666	March 11, 2023

Get_batched_dataset in C2_W4_Lab_3_using-TPU-strategy

Related topics