Is 1 week of course mostely inactual with new tf versions?

someone555777 · August 28, 2023, 8:16pm

Just found that due to deprecated module “preprocessing” the Tokenizer and another usfull funcs haven’t worked already. And developers changed it on TextVetorization layer that can be called just from model.

After I saw this I understood that I don’t completely understand the concept of TensorFlow. Do I understand correct, that developers don’t like when any steps can be used without models? Why is it? It looks really exsessive when I just want to try to check the working of any features/apis but need to create a model for this. Also, any of funcs can be defined by developers of TF as step in model and as potential Layer, isn’t it?

balaji.ambresh · August 29, 2023, 3:55am

Please read this

The exam expects you to upload the fit model. So, the system doesn’t care if you use a deprecated API. The recommended APIs use GPU effectively and so have better performance over the deprecated APIs that use CPU.

The staff are aware of usage of deprecated APIs and will update the notebooks.

someone555777 · August 29, 2023, 8:12pm

ok, thanks. So, do I understand correct the tendency?

Is usage of GPU the one reason of the functionality transfer to Layers?

balaji.ambresh · August 30, 2023, 8:01am

Performance is one of the main reasons for deprecation of tf.keras.preprocessing layers.

For instance, when using image_dataset_from_directory, augmentations are done in layers within the model which internally uses GPU. ImageDataGenerator uses CPU to perform augmentations prior to feeding data into the model.

I’ve observed a good amount of difference when it comes to training time when using the newer recommended APIs.

someone555777 · September 6, 2023, 5:26pm

additionally I found answers in Keras documentation

The ideal machine learning model is end-to-end

In general, you should seek to do data preprocessing as part of your model as much as possible, not via an external data preprocessing pipeline. That’s because external data preprocessing makes your models less portable when it’s time to use them in production. Consider a model that processes text: it uses a specific tokenization algorithm and a specific vocabulary index. When you want to ship your model to a mobile app or a JavaScript app, you will need to recreate the exact same preprocessing setup in the target language. This can get very tricky: any small discrepancy between the original pipeline and the one you recreate has the potential to completely invalidate your model, or at least severely degrade its performance.

It would be much easier to be able to simply export an end-to-end model that already includes preprocessing. The ideal model should expect as input something as close as possible to raw data: an image model should expect RGB pixel values in the [0, 255] range, and a text model should accept strings of utf-8 characters. That way, the consumer of the exported model doesn’t have to know about the preprocessing pipeline.

Topic		Replies	Views
Is the Tokenizer deprecated and no longer recommended for being used!? Natural Language Processing in TensorFlow week-module-1	3	1092	February 20, 2023
Urgent update needed: image_dataset_from_directory and preprocessing layers Convolutional Neural Networks in TensorFlow week-module-2	1	577	September 20, 2022
Welcome to course 2 of DeepLearning.AI TensorFlow Developer Professional Certificate Specialization TF Developer Professional Certificate Resources	6	220	August 24, 2023
ImageDataGenerator() is deprecated in Tensorflow 2.9 Introduction to TF for Artificial Intelligence ... week-module-4	8	6222	June 29, 2023
Week 2 - Tensorflow question (Alpaca classifier) Convolutional Neural Networks coursera-platform	3	530	July 6, 2022

Is 1 week of course mostely inactual with new tf versions?

Related topics