U-Net Architecture "up-conv" operation

paulinpaloalto · September 29, 2024, 10:05pm

It’s also worth noting that it is completely common and “flavor vanilla” that algorithms get improved over time from what was in the original paper. As things get deployed and used at scale, people come up with improvements. E.g. how many versions of YOLO have there been now, since the original 2015 paper?

Another interesting and nice clear example of such an improvement is “inverted” drop out. Go back and read the original Srivastava, Hinton and Sutskever paper that introduced drop out and notice that they hadn’t thought of the “inverted” idea yet, so they had to “downscale” the weights at “inference” time. The way it is done now where we multiply by 1/keep_prob at training time to rescale the expected values is just so much cleaner and simpler. I’ll bet Hinton does a big Homer Simpson “D’oh!” every time he remembers that oversight. Here’s a thread that discusses this point in more detail.

Topic		Replies	Views
U-Net Convolutions After Transpose Convolutions Convolutional Neural Networks week-module-3 , coursera-platform	4	426	January 22, 2024
U-Net \| Why in U-Net architecture we used "Transpose Convolution" instead of "1*1 convolution" to decrease the number of channel? Convolutional Neural Networks coursera-platform	5	868	December 10, 2021
C4W3: U-Net Algorithm Convolutional Neural Networks coursera-platform	2	544	February 7, 2022
Course 4 - Week 3 - Transpose Convolution Question Convolutional Neural Networks coursera-platform	1	610	June 4, 2021
Is this a small error? Convolutional Neural Networks coursera-platform	4	564	August 22, 2022

U-Net Architecture "up-conv" operation

Related topics