Please try the following :
- Define a function
get_sigmoid(pixel_value, weight)
where you return the sigmoid for thepixel_value * weight
value. - Create a bunch of weights using
np.linspace
between 0 and 1e-5. - See how sigmoid varies when pixel value is 255 and 1.
This should tell you which scale of values will help speed up learning keeping backpropagation in mind.
Have you seen this?