Normalization(axis=-1)

Utsav_Sharma1 · January 18, 2023, 5:22pm

I read the post by AshutoshSahu and also the answers given for that but still i am confused. In that post i followed the answer given by raymond but didn’t get the last part.

ary = np.array([
    [ [1, 2, 3,], [2, 3, 4,], ],
    [ [5, 4, 3,], [3, 2, 1,], ],
])

suppose we have a numpy array of shape (2,2,3). if we put axis=-1 then is it taking mean and variance of [1,2,3],[2,3,4],[5,4,3],[3,2,1] separately and then calculating the final values?

Juan_Olano · January 18, 2023, 5:38pm

This array has 3 axis: 0, 1, and 2. So axis=-1 would be doing the operation over the axis 2. If we do a sum over the axis 2 (or axis = -1) then we would convert the resulting array into a 2-dim array with values equal to the operation over the values of 2.

For instance:

ary.sum(axis=-1) = ary.sum(axis=2) =

[[ 6 9]
[12 6]]

See how we arrived to a shape=(2,2) from a shape=(2,2,3) because we ‘consolidated’ everything on axis=-1 (the axis with index 2 starting from 0).

Same principle would apply to other operations like mean.

You can do some tests by doing things like:

print(ary.sum(axis=0))
print(ary.sum(axis=1))
print(ary.sum(axis=2))
print(ary.sum(axis=-1))
print(ary.sum(axis=-2))

Running these tests may shed light to your question.

Hope it helps,

Juan

Utsav_Sharma1 · January 18, 2023, 6:38pm

tell me if am right.
the normalization process is basically calculating the mean and variance for each column because each column represent a features and that is why we have given axis = -1 because we want the mean variance values for each feature.

Juan_Olano · January 18, 2023, 8:29pm

Normalization is scaling the input variables so that they have similar ranges of values. We don’t want, for instance, some variables with values under 100 and some other values with values over 100,000. With normalization we, well, normalize these inputs to prevent one variable from dominating the others.

One way to achieve normalization is by subtracting the mean of each variable and dividing by the standard deviation.

rmwkwok · January 19, 2023, 3:20am

Hello @Utsav_Sharma1,

You need to tell us which function you are setting axis=-1 to, and what dataset you are talking about.

Function: I suppose it is tf.keras.layers.Normalization.
Dataset: I suppose the dataset has a shape of (m, n), where m is number of samples, and n is number of features.

Given the above two, your description is correct. You can also find a very similar description by reading the documentation of tf.keras.layers.Normalization, in which it says:

For example, if shape is (None, 5) and axis=1 , the layer will track 5 separate mean and variance values for the last axis.

Cheers,
Raymond

Topic		Replies	Views
What does mean -1 in Normalization? Advanced Learning Algorithms week-module-1	8	442	April 12, 2024
Normalization in keras Advanced Learning Algorithms week-module-1	3	564	December 3, 2022
How does axis=-1 make sense in tf.keras.layers.Normalization? Advanced Learning Algorithms week-module-1	4	1009	December 1, 2022
C2_W1_Lab02_CoffeeRoasting_normalization Advanced Learning Algorithms week-module-1	4	93	April 3, 2025
Understanding error because of axis parameter in Normalization Advanced Learning Algorithms week-module-1	5	669	August 12, 2023

Normalization(axis=-1)

Related topics