Question regarding SHAP-values

chippa · October 26, 2024, 7:08pm

Hi! I have a question regarding SHAP-values. I trained a neural network predicting a score (regression) and want to derive the variable importance using SHAP-values. My model was trained on scaled (normalized) input features. Now my question is: when I calculate the SHAP-values, do my input features also have to be normalized? Or is it allowed to use the original inputs? I tried both approaches - once with normalized inputs and once with the original inputs. Interestingly, from a logical and practical standpoint, the importance makes much more sense when using the non-normalized variables.

The code I used is:
explainer = shap.Explainer(model.predict, X_t_scaled)
shap_values = explainer(X_t_scaled)

Looking forward for any advise! Thanks and best regards!

balaji.ambresh · October 27, 2024, 3:34am

model.predict must make use of scaled data. So, please use X_t_scaled.

See this example as well.

For deep learning models, you can use shap.DeepExplainer like this:

explainer = shap.DeepExplainer(model, X_train_scaled)
shap_values = explainer.shap_values(X_test_scaled)

chippa · October 27, 2024, 8:23am

Thanks for the help! When using your provided code I get the following error message: UserWarning: Your TensorFlow version is newer than 2.4.0 and so graph support has been removed in eager mode and some static graphs may not be supported. See PR #1483 for discussion. warnings.warn(“Your TensorFlow version is newer than 2.4.0 and so graph support has been removed in eager mode and some static graphs may not be supported. See PR #1483 for discussion.”)

Is there a solution to that?

balaji.ambresh · October 27, 2024, 9:32am

Did you see this?

chippa · October 27, 2024, 9:51am

Yes, I tried it, but unfortunately it only led to further error messages. Are there any other solutions to this problem?

balaji.ambresh · October 27, 2024, 1:19pm

Have you tried to use a version of tensorflow < 2.4 ?

chippa · October 27, 2024, 2:27pm

Yes, it works but somehow when I want to plot it it is “empty”. It says 0 features

for plotting I used:

shap.summary_plot(shap_values)

balaji.ambresh · October 27, 2024, 5:14pm

The 2nd parameter to shap.summary_plot is features which represents the values for which you computed shap values.

Topic		Replies	Views
Shap-values for multiclass classification AI Discussions model-evaluation	4	360	November 14, 2024
Week 2 lab 3 y_train Supervised ML: Regression and Classification week-2	5	52	April 2, 2025
Can someone help explain this line Supervised ML: Regression and Classification week-2	8	430	July 27, 2023
C1_W2_Lab1 Exercise 7. Why does normalization change the results so much? Introduction to TF for Artificial Intelligence ... week-2	20	937	September 7, 2022
Week 2, C1_W2_Lab04_FeatEng_PolyReg_Soln, "Scaling Features" example - z-score scaling intuition Supervised ML: Regression and Classification week-2	2	20	November 6, 2024

Question regarding SHAP-values

Related topics