Interpretability of Models #18

mtanco · 2021-03-23T17:52:39Z

For H2O-3, this code snippet may be helpful. For a specific row in the dataset it creates a table which has the results of trying different values for the users and looking at how the prediction changes. This is really important for explainability. This code specifically creates the partial dependence plot for the top positive and negative feature:

import h2o
from h2o.automl import H2OAutoML
h2o.init()

# Load data into H2O
df = h2o.import_file('https://h2o-internal-release.s3-us-west-2.amazonaws.com/data/Splunk/churn.csv')
y = 'Churn?'
x = df.columns
x.remove(y)

# Build models
aml = H2OAutoML(max_models = 2, seed = 1)
aml.train(x = x, y = y, training_frame = df)

# Save the best model
model = aml.leader

# Get how much each feature contributed for each person
pred_contribs = model.predict_contributions(df).drop('BiasTerm').as_data_frame()

# ID of the phone nubmer 
row_id = 77


# Columns that are important for this user
min_contrib = pred_contribs.idxmin(axis=1)[row_id]
max_contrib = pred_contribs.idxmax(axis=1)[row_id]


min_pdp = model.partial_plot(
    df, 
    cols=[min_contrib],
    plot=False,  # change to false, just for debugging
    nbins=20 if not df[max_contrib].isfactor()[0] else 1 + df[max_contrib].nlevels()[0],
    row_index=0
)
display(min_pdp)


max_pdp = model.partial_plot(
    df, 
    cols=[min_contrib],
    plot=False,  # change to false, just for debugging
    nbins=20 if not df[max_contrib].isfactor()[0] else 1 + df[max_contrib].nlevels()[0],
    row_index=0
)
display(max_pdp)

vopani · 2021-06-14T09:08:56Z

This example: https://wave.h2o.ai/docs/examples/ml-h2o-shap shows how to get the SHAP values from a WaveML model. I think that is good enough for developers to build custom downstream plots/cards.

@geomodular Is there anything more we want to accomplish with WaveML regarding this?

geomodular · 2021-06-14T11:40:09Z

The ideal goal would be to explain the model using Wave ML interface without interfering with .model param. i.e. m.explain(). The same should be doable with DAI model as well.

That's the general idea we can bend.

mtanco added the type/feature Feature request label Mar 23, 2021

geomodular added the type/design label Mar 24, 2021

geomodular added area/ml Machine learning related issues and removed type/feature Feature request labels Apr 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interpretability of Models #18

Interpretability of Models #18

mtanco commented Mar 23, 2021

vopani commented Jun 14, 2021 •

edited

Loading

geomodular commented Jun 14, 2021

Interpretability of Models #18

Interpretability of Models #18

Comments

mtanco commented Mar 23, 2021

vopani commented Jun 14, 2021 • edited Loading

geomodular commented Jun 14, 2021

vopani commented Jun 14, 2021 •

edited

Loading