YOLO assignment exercise 1 shape question

Peeta_Li · December 22, 2021, 4:48am

Hello,

For exercise 1 in the YOLO assignment, the tensor box_class_scores is of the shape (19,19,5) and the boolean mask filtering_mask is also of the shape (19,19,5). However, after doing scores = tf.boolean_mask(box_class_scores, filtering_mask), the output shape is (1789, ) and I’m not sure where this 1789 came from. I was expecting the shape to be (1805,) where 1805 = 19*19*5.
I must missed something here and thank you in advance for helping out !

isaac.casm · December 22, 2021, 12:04pm

Hi Peeta_Li,

tf.boolean_mask extracts the positions in box_class_scores where filtering_mask is True. So, in your case filtering_mask has 1789 Trues and 16 Falses.

Cheers

Peeta_Li · December 23, 2021, 4:59am

Thank you, that totally makes sense!

Kaiju · December 24, 2021, 8:21pm

thank you for the clarification. This means that the output is position where the mask value is true. then it unrolled into (1789,). Is this flattening the consequence of leaving the axis to default of 0?

What was the logic for not somehow keeping the original tensor dimensions?

ai_curious · December 25, 2021, 2:12pm

I don’t love the use of position there. The output does contain the elements of the input that satisfy the filter mask, but position information is not retained. Maybe think of it almost as whitespace removed?

The TF doc describes the output as a tensor populated by entries in [input tensor] corresponding to True values in [mask] . it is not sparse, meaning the False values are dropped entirely.

isaac.casm · December 25, 2021, 2:28pm

What was the logic for not somehow keeping the original tensor dimensions?

The problem is that you do not have the same number of elements as before. Imagine a simple 3x3 tensor, so 9 elements. If you remove one element, there are 8 elements remaining so you cannot get a 3x3 tensor anymore. You may be able to reshape it to 4x2, 2x4, 1x8, 8x1, but it is arguably better to leave that decision to the user.
If you are using tf.boolean_mask the assumption is that you want to extract the elements and therefore you do not care about the shape anymore. If you want to specify the locations of interest and keep the dimension of the original tensor, then a mask would give you that (actually filtering_mask has that information).

Hope it was clear.

Kaiju · December 25, 2021, 2:58pm

Thank you for the explanation.

arpit04 · January 1, 2022, 5:04pm

@isaac.casm @ai_curious
Why are there 1789 Trues and 16 False??

isaac.casm · February 2, 2022, 9:41am

I totally missed this message with the Christmas holidays.
To be absolutely fair, I am not sure, I don’t know where that code comes from. My guess is that it is the class probabilities of the model output. If that is the case, then most likely the model was still being trained and that is the reason for having such a large number of Trues. But this number will change depending on the input image.

ai_curious · February 2, 2022, 10:53pm

At this point in the class exercise the values are just random numbers, so the results depend entirely on the sample distribution specified in the call to the generator. It’s not realistic, so you can ignore the numeric values here; only the shapes are reasonable.

Topic		Replies	Views
Car detection with YOLO: mask operation and dimensions in yolo_filter_boxes function Convolutional Neural Networks	2	570	December 24, 2021
W3A1 YOLO assignment boolean masking Convolutional Neural Networks	7	622	May 10, 2023
Week 3 YOLO Assignment - yolo_filter_boxes confusion Convolutional Neural Networks	1	864	April 26, 2022
Course 4 week 3 Yolo assignment Convolutional Neural Networks	2	565	November 20, 2021
Course 4 week 3 assignment 1: yolo - error in yolo_filter_boxes Convolutional Neural Networks	4	559	September 16, 2022

YOLO assignment exercise 1 shape question

Related topics