Week 3 - Assignments 1 & 2: Why Do We Input Data into the Models Differently?

Harvey_Wang · November 23, 2022, 9:00am

In Assignment 1 (using YOLO), we load a pretrained model and then input the input data into the model. However, in Assignment 2 (using U-Net), we train a model and then use the predict() method on the input data.

Why is the model applied to the data in two different ways?

kchong37 · November 23, 2022, 4:30pm

In the 1st assignment, we make use of results from a pre-trained YOLO model (already built and tuned by others with high accuracy) and implement non-max suppression (IOU) to select the right boxes to achieve object detection, this is an example of transfer learning.
While in the 2nd assignment, we build our own U-Net, then train it to achieve image segmentation.

Harvey_Wang · November 23, 2022, 5:45pm

So, I’m still confused.

From what I can understand (and I’m not sure if this is right), we only use predict() if we train the model ourselves. And if so, why can’t we use it on a pretrained model? We still imported that model.

kchong37 · November 23, 2022, 6:16pm

Hi there, I think we can use predict() as long as we have the corresponding model parameters. We train the model parameters with the aim of achieving high accuracy, such that we can use the model to perform good predictions.
For the YOLO assignment, the call of the tf.keras.models.load_model() function (see more about the function usage here) loads the pre-trained model parameters into our workspace. Then we can use the predictions (output) from the YOLO model to identify and filter bounding boxes.
For the U-Net assignment, we build the model from scratch and then train it ourselves to get parameters that can achieve high enough accuracy, then perform predictions.
Let me know if you have questions!

Harvey_Wang · November 23, 2022, 6:39pm

So, writing yolo_model.predict(image__data) is just as valid as what we did in the assignment yolo_model(image_data)?
And likewise, unet(image) is just as valid as unet.predict(image)?

kchong37 · November 23, 2022, 7:03pm

There is a difference between __call__() (i.e. model(inputs)) and predict().
predict() is designed for batch processing of large numbers of inputs, while the __call__() function can give faster execution for small numbers of inputs, please see details here and in this FAQ entry.

Harvey_Wang · November 23, 2022, 7:18pm

Thank you so much! This is what I was confused about. So, there are subtle differences between the methods functions.

Topic		Replies	Views
Transfer Learning of U-Net Convolutional Neural Networks coursera-platform	4	712	April 27, 2022
Keras.predict giving different results depending on how used Convolutional Neural Networks coursera-platform	4	835	January 21, 2022
Course 4 - Week 3 - MobileNet Convolutional Neural Networks coursera-platform	2	587	June 17, 2021
Question on Train/Test Split and Data Handling in Pre-Trained Models for NMT Assignment Deep Learning Resources ai-discussions , project , coursera-platform	3	22	December 4, 2024
Same Dataset for Classification and Object Detection? Convolutional Neural Networks coursera-platform	2	529	March 13, 2023

Week 3 - Assignments 1 & 2: Why Do We Input Data into the Models Differently?

Related topics