Object detection vs Semantic segmentation

Anbu · October 15, 2021, 5:18pm

Hi Sir,

@thearkamitra @arosacastillo @AmmarMohanna @XpRienzo @reinoudbosch @chrismoroney39 @paulinpaloalto

We would like to know, when there is a need of skip connection for semantic segmentation but why not skip connection used for object detection ?

For object detection CNN, we are also doing compressing the image right, dimension of the image right then why skip connection not used for it sir?

reinoudbosch · October 15, 2021, 5:39pm

Hi Anbu,

You can have a look at this discussion.

paulinpaloalto · October 15, 2021, 6:56pm

I haven’t yet had time to read the full article that Reinoud has given us, but it looks excellent and gives detailed explanations of the differences between the two approaches.

But the short answer is that in the object detection case, you’re not trying to reconstruct a labelled version of the input image, right? That is what requires the skip connections in the Semantic Segmentation case. In Object Detection and Localization the location information is dealt with in a different way: it is expressed by the bounding boxes that are part of the output. That’s a fundamentally different approach. YOLO is very deep waters. If you want to understand that in more detail, you should go through some of the excellent explanations put together by ai_curious on these forums. Start with this post and the earlier ones that it links to.

ai_curious · October 15, 2021, 11:43pm

Who says it isn’t? Joseph Redmon, for example, writes …

Our new network is a hybrid approach between the network used in YOLOv2, Darknet-19, and that newfangled residual network stuff. Our network uses successive 3 × 3 and 1 × 1 convolutional layers but now has some shortcut connections as well…

YOLOv3: An Incremental Improvement. 2018

One observation I make is that semantic segmentation only supports a single classification per pixel. YOLO object detection allows multiple objects of different shape to be superimposed…the classic example is person standing in front of a car. Depending on your use case, that may or may not be significant.

Topic		Replies	Views
Object detection NN Convolutional Neural Networks coursera-platform	3	485	May 30, 2023
Questions about YOLO Convolutional Neural Networks coursera-platform	13	2453	January 23, 2025
Week 3 Yolo Doubt About Sliding Window Convolutional Neural Networks coursera-platform	7	755	August 18, 2024
Same Dataset for Classification and Object Detection? Convolutional Neural Networks coursera-platform	2	529	March 13, 2023
Yolo centroids/conv impl Convolutional Neural Networks week-module-3 , coursera-platform	15	233	May 15, 2024

Object detection vs Semantic segmentation

Related topics