LSTM for oil, gas, and water production

Samuel_Chazy · November 22, 2023, 11:34am

Looking at the distribution plots, you can see how the data in many features is heavily skewed. Looking at the Boxplots, you can also see that there are a lot of outliers.

All of this affects heavily the machine learning models and the deep learning models. This needs a lot of work, which i can’t possibly detail in here.

I will give you some suggestions. You would need to try to normalize the distributions wherever possible. Some distribution have 0 values and then a normal distribution, bimodal distributions, etc… which you need to deal with.

Remove the outliers (use quantiles) which are definitely affecting the model performance and probably driving the bias in the outcome.

I hope this helps.

Regards,
Samuel

saifkhanengr · November 22, 2023, 1:43pm

Thank you for your insights, Samuel…

saifkhanengr · November 27, 2023, 3:51pm

Hello Everyone! I hope you all are doing well.

After giving more time to hyperparameters tunning, this is what I got:

One Conv1D with filters=284, kernel_size=3, one LSTM with 284 units, relu, dropout with 0.5, and one dense layer with sigmoid. Loss is MeanSquaredLogarithmicError, rather than mse.

However, my intended Professor said that this is still the wrong approach to use AI in the petroleum industry. He said you must have to include spatiotemporal data, not only one of them, for commercial purposes. And he is absolutely right.

Christian_Simonis · December 3, 2023, 5:31pm

Thanks for posting, @saifkhanengr!

Looks like a real exciting project! Congrats

How long is the prediction horizon (in your plot)?
I always made good experience with adding a (naive) prediction (e.g. derived by a linear regression model or even a constant prediction) as a benchmark to evaluate how well the model of interest performs in that prediction horizon in comparison, see also this thread.
Did you try that already?

Keep it up, @saifkhanengr!

Best regards
Christian

saifkhanengr · December 4, 2023, 11:04am

Thank you, Christian!

It’s 14 months for the test set.

Yes, I set a simple RNN as a benchmark (baseline model) and then trained many different models and compared them with that baseline. The best one is the LSTNet (Long- and Short-term Time-series network ), a combination of LSTM and Convolution.

Topic		Replies	Views
Adjust hyperparameters of lstm model for time series prediction AI Discussions	1	84	May 18, 2023
Prediction in the future dates AI Discussions ai-discussions , project	20	549	May 8, 2024
C4W4 Assignment: Cannot achieve MSE below 6, MAE below 2 Sequences, Time Series and Prediction week-4	31	970	April 15, 2024
Questions for choosing model for History Matching AI Discussions ai-discussions , project	1	54	June 11, 2024
Implementation of CNN-LSTM according to a research paper AI Discussions	1	63	June 16, 2024

LSTM for oil, gas, and water production

Related topics