Implementation of CNN-LSTM according to a research paper

owaisniaz · June 13, 2024, 10:02am

Hi guys, I have recently completed the course and was trying to replicate the results of an open-access research paper (Air-pollution prediction in smart city, deep learning approach) but I was not able to achieve the results and I am a bit confused about what I am doing wrong.

I am having some confusion regarding:

How they imputed the WD column and when? As they mentioned Spline interpolation before encoding WD values but WD also has missing values. Other than that I am getting the exact correlation values of all the other features as mentioned in Figure#8 and 10. (Right now I am doing encoding then imputation with Spline)
I am not really sure which of the features were dropped from a total of 23 features (I dropped NO2, SO2, O3) because the Fig#12 mentioned 20 features.
I have created the train and test set as mentioned in the paper (80/20) but not sure what was used for validation as there is callback of EarlyStopping with min delta and patience and I think there must be a validation set for that to work. (Currently passing the test_ds as validation_data)
I dont know if they shuffled the data or not. I am getting better results with shuffling.
They mentioned 1 day lag, so I created the tensors like this [samples, 24hrs(for 1 day lag), 20 features(as in fig 12)] with 32 batch size with the help of build dataset function.
But after all I am NOT getting 0.989 R2, 6.x MAE, or 12.x RMSE on the test or train set. And my evaluation looks like this, 0.8 or 0.9 R2, 10.x or 9.x MAE, and 18.x or 16.x RMSE

This is my first implementation and I am using the code from lectures. I have tried a lot to replicate the results as mentioned in paper but could not, if you guys have any idea so kindly help.

balaji.ambresh · June 16, 2024, 4:47am

Please share the following details for someone with the bandwidth to help you out on this non-course related issues:

Link to the code repository
Link to paper as pdf with public access
Usually, those who author a paper share their implementation. Is there any information you can share about this?

Topic		Replies	Views
Improving Training accuracy of LSTM in C3W4 assignment Natural Language Processing in TensorFlow week-4	6	381	August 3, 2023
[Week 1] Assignment 1 Final exercise (Ex. 8) lstm_backward function, partially wrong expected output Sequence Models	6	651	January 22, 2024
Week 1 Assignment 1 Exercise 3 - lstm_cell_forward Sequence Models	1	427	July 28, 2023
Week4 - Assignment 1 / 3.2. Triplet Loss Convolutional Neural Networks	4	578	March 3, 2022
C5W1 Exercise 8 lstm_backward Sequence Models	3	695	January 22, 2024

Implementation of CNN-LSTM according to a research paper

Related topics