Caltech Bird Detector

BrutalCaeser · December 4, 2021, 8:10am

I have multiple doubts:

What is the use of the 3rd bounding_boxes_on_image_array function or the 2nd function?
What is the number n in the dimension s of boxes [N,4]
In read_image_tfds why do we divide by 127.5 and then subtract -1 from image?
Why is training_dataset an object but visualization_training_dataset used for length measurements?
The boxes dimension does not signify if its ymin then xmin and so on

jackliu333 · December 6, 2021, 10:04pm

Please find my reply below.

What is the use of the 3rd bounding_boxes_on_image_array function or the 2nd function? - utility function to draw a bounding box (a 2d numpy array) on image
What is the number n in the dimension s of boxes [N,4] - N is the number of images
In read_image_tfds why do we divide by 127.5 and then subtract -1 from image? - This is to normalize the original image to the desired scale.
Why is training_dataset an object but visualization_training_dataset used for length measurements? - Could you clarity the question? I didn’t find the function visualization_training_dataset.
The boxes dimension does not signify if its ymin then xmin and so on - Could you rephrase the question?

Topic		Replies	Views
C3W1 birdbox assignment Advanced Computer Vision with TensorFlow week-module-1	21	557	February 20, 2024
Week 2 assignment bounding boxes Advanced Computer Vision with TensorFlow week-module-2	1	578	May 16, 2023
C3W1 loading real positions of numbers Advanced Computer Vision with TensorFlow week-module-1	10	42	July 29, 2024
C3_W1_Lab_3 draw_bounding_boxes_on_image function Advanced Computer Vision with TensorFlow week-module-1	3	423	August 1, 2023
How does the viz_utils plot object detection output? Advanced Computer Vision with TensorFlow week-module-2	6	527	October 23, 2023