Tensorflow (Keras) model.predict() is much slower than numpy implementation

antonbelousov · January 19, 2023, 6:07am

In labs with Coffee Roasting (C2_W1_Lab02_CoffeeRoasting_TF and C2_W1_Lab03_CoffeeRoasting_Numpy), the TF implementation of prediction (model.predict function) is much slower than the custom implementation using Numpy (my_predict). This is clearly visible in the last cells of both labs, when generating final graphs - the TF version takes 69 seconds, while the Numpy one takes 1 second on my computer.

Why is that?

gent.spah · January 19, 2023, 7:29am

It might be because numpy package is optimized for matrix operations better than just tensorflow tensors!

TMosh · January 20, 2023, 6:51am

The “Coffee roasting using numpy” lab doesn’t do any training, it uses pre-computed weights (which came from the TensorFlow solution) and only does predictions.

When TensorFlow did the NN training, it took over 6000 iterations. The numpy lab wasn’t asked to do that level of work.

antonbelousov · January 20, 2023, 8:01am

I was running only the last step with charts, that had model.predict in the TF version - I thought by that time the model is already trained by previous model.fit step.

Here, to make it easier I removed the plotting code and after all trainings I left only 1 call to model.predict and my_predict and tried to measure their times using same test data and using %timeit.

Numpy version - 1 millisecond:
Screenshot 2023-01-20 at 10.58.55

TF version - 50 milliseconds (50 times more!):
Screenshot 2023-01-20 at 10.59.11

antonbelousov · January 20, 2023, 9:14am

Found this discussion on Stackoverflow - python - TF.Keras model.predict is slower than straight Numpy? - Stack Overflow

Seems the problem appears for some datasets and models because of TF’s “eager mode” (whatever that means).

Thank you all, I thought there is some more obvious reason for that.

TMosh · January 21, 2023, 3:13pm

Here is some information on eager mode:

antonbelousov · January 21, 2023, 6:44pm

@TMosh thanks a lot! Now everything seems to be clear.

Also I thought it shouldn’t be a big deal - we can train a model in TF, and build the same model in Numpy with weights from TF for prediction (at least for feed-forward networks, didn’t get to more complex one yet).

Topic		Replies	Views
Tensorflow performance Structuring Machine Learning Projects week-1 , week-2 , ai-discussions , coursera-platform	7	42	July 23, 2024
Week 3 - Tensorflow takes more time than scratch implementation Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	545	June 22, 2021
Implementing model Introduction to TF for Artificial Intelligence ... week-2	5	565	November 4, 2021
Training the model is much slower than the output shows NLP with Sequence Models week-1	4	498	June 26, 2022
Architecture has 20 minutes per epoch Natural Language Processing in TensorFlow week-3	12	593	August 30, 2023

Tensorflow (Keras) model.predict() is much slower than numpy implementation

Related topics