Please try the following :
- Define a function
get_sigmoid(pixel_value, weight)where you return the sigmoid for thepixel_value * weightvalue. - Create a bunch of weights using
np.linspacebetween 0 and 1e-5. - See how sigmoid varies when pixel value is 255 and 1.
This should tell you which scale of values will help speed up learning keeping backpropagation in mind.
Have you seen this?