Scaling multivariate LSTM time-series forecasting

astriagustius · July 9, 2025, 8:34am

Hello everyone, I have just started learning about multivariate time series forecasting using LSTM. I am confused about the data preprocessing part where we need to scale the data. Do we need to scale all data (input and target) or just the input?

In some cases, there is a distinction between the input scaler (usually named scaler_X) and the target scaler (scaler_Y). Can’t we just use one scaler if all the data needs to be scaled?

TMosh · July 9, 2025, 2:27pm

In general (this is not unique to LSTMs), scaling the features allows the optimizer (which finds the weights that minimize the cost) to work more efficiently.

Usually scaling the outputs is not very useful.

Depending on the dataset (and the range of magnitudes of the outputs), you might also scale the output labels.

astriagustius · July 15, 2025, 12:41am

thankyou for your answer, but could you explain more what you mean by “depending on the dataset”?
what kind of dataset that required us to scale the output labels?

TMosh · July 15, 2025, 4:52am

Only if the output values vary over a large range of values, or if they are numerically very large.

astriagustius · July 15, 2025, 8:15am

okay, i understand.
thankyou very much @TMosh

Topic		Replies	Views
Feature_Scaling - Don't we need to scale the target values as well? Supervised ML: Regression and Classification week-module-2	18	2787	May 13, 2024
Feature scaling--also scale y_train? Supervised ML: Regression and Classification week-module-2	1	428	July 20, 2023
Should we scale the training outputs for model training as well? Or just the input features? MLS Resources	2	295	November 25, 2023
Feature scaling and target values Advanced Learning Algorithms week-module-3	3	25	January 10, 2025
Doubt in Feature scaling Supervised ML: Regression and Classification week-module-2	7	582	November 5, 2022

Scaling multivariate LSTM time-series forecasting

Related topics