Hi all,

I didn’t understand the skip approach from ResNets.

Given that:

If I skip the calculation of Layer [l+1], then a[l+1] isn’t calculated, but a[l+2] depends on this value.

What did I miss in this approach?

Thanks in advance.

Hi all,

I didn’t understand the skip approach from ResNets.

Given that:

If I skip the calculation of Layer [l+1], then a[l+1] isn’t calculated, but a[l+2] depends on this value.

What did I miss in this approach?

Thanks in advance.

Hi,

no you do not skip layers, but rather you can understand the formula as adding an extra term which come from 2 layers before.

For example:

a[1] = g( W[1]a[0] + b[1])

a[2] = g(W[2]a[1] + b[2] + **a[0]**)

a[3] = g(W[3]a[2] + b[3] + **a[1]**)

a[4] = g(W[4]a[3] + b[4] + **a[2]**)

etc.

all the bold terms are the “residues”, namely extras, if you remove them then you have “standard FClayers”