Why do we use np.tile function?

mehmet_baki_deniz · January 4, 2023, 2:47pm

hi
in C2_W1_Lab02_CoffeeRoasting_TF, it says : ‘Tile/copy our data to increase the training set size and reduce the number of training epochs.’
why do we need to just copy and paste the training data? after all it is the same data. how can it assist the training process with the same data?

RyanCarr · January 4, 2023, 3:02pm

By increasing the data set size you are increasing the number of times you adjust your parameters. This means you get more adjustments per epoch. Say you have 10 data examples. If you duplicate it 10 times you get 100 items in your training set. This way you get 100 times of adjusting the w and b per epoch. Otherwise you’d have to run 10 epochs to get the same number of adjustments to w and b.

mehmet_baki_deniz · January 4, 2023, 3:10pm

thank you for the response

ealtan · February 16, 2023, 5:30pm

I had the same question and multiplication of the same data 1000 times still does not makes sense to me. It is the same data, it will not provide any new information. And what “does reduce the number of training epochs” mean in this context ? I am guessing that we will learn more about it next week when we learn more about “compile” step ?

rmwkwok · February 17, 2023, 1:31am

Hello @ealtan,

I think the reason for np.tile in that particular lab is simply to save some epochs. The idea is that, instead of running our original (before tile) samples for 1000 epochs so that the algorithm will see them 1000 times, we now tile my samples so that the algorithm will see them 1000 times within 1 epoch.

I believe the reason for saving some epoch is for speed. There is overhead in Tensorflow when it swiches from one epoch to another epoch. Now we save 1000 times those overheads by the np.tile arrangement.

We can do this when we already know that repeating samples that many times won’t hurt the final performance.

Cheers,
Raymond

Fac_Port · August 12, 2023, 4:49pm

Hello, @rmwkwok .
I have run this lab maintaining m=200 (without using np.tile()) and putting epochs=10.000, instead m=200.000 and epochs=10. I have got loss=0,0016 to the first option against loss=0,002 to the second one. I have been penalized by the time of running, but the result was better. I do not know if this time increase would be unacceptable for a large dataset. How could I know when to use each option (increasing dataset and decreasing epochs or vice-versa) ?
Thank you.

rmwkwok · August 13, 2023, 1:04am

Hello @Fac_Port,

The core idea of my last reply is only about the time. This is how, I believe, the lab was considering when deciding to do that.

For your cases, I have no particular suggestion as to when to do what. I recommend you to study your problem and algorithm on a case-by-case basis, and decide what to do on your own. It is also cruical for you to understand what is going on there. For example, why would the loss be different? Should they be different? What have you done to generate explanations and what have you done to persuade yourself your explanation is right?

Good luck!
Raymond

Fac_Port · August 13, 2023, 1:35pm

Thank you.

Topic		Replies	Views
[C2_W1_Lab02_CoffeeRoasting_TF] questions about copy/tile Advanced Learning Algorithms week-1	10	666	December 13, 2022
C2 W1 : why tile/copy data before training Advanced Learning Algorithms week-1	3	580	December 21, 2022
How tile data can reduce the number of training epochs? Advanced Learning Algorithms week-2	2	546	July 18, 2023
Coffee roasting lab: np.tile purpose Advanced Learning Algorithms week-1	2	469	April 24, 2023
C2_W1_Lab02_CoffeeRoasting_TF Advanced Learning Algorithms week-1	2	511	August 14, 2023

Why do we use np.tile function?

Related topics