Why do we pass X from the previous layer into BatchNormalization layer like this X = BatchNormalization(axis = 3)(X). can’t we directly pass X as a argument? Also does X = BatchNormalization(axis = 3) return a function to which we pass X??
Maybe take a look at this thread and let us know what you discover ?