Auto choose most appropriate explainable model #355

gaugup · 2020-12-17T01:36:08Z

This PR helps choose the best possible surrogate model by training multiple surrogate models based on accuracy or r2_score.
If the training of multiple surrogate model fails for some reason, then we train the explainable model passed on by the user.
We compute a replication metric (accuracy for classification and r2_score for regression) which helps find which of the surrogate models was a better fit.

Signed-off-by: Gaurav Gupta <gaugup@microsoft.com>

…eModel Signed-off-by: Gaurav Gupta <gaugup@microsoft.com>

Signed-off-by: Gaurav Gupta <gaugup@microsoft.com>

imatiach-msft

I think the code itself looks good but I'm concerned about structure and complexity, maybe we can discuss these changes more before moving forward with this PR

imatiach-msft · 2020-12-31T18:21:27Z

python/interpret_community/mimic/mimic_explainer.py

@@ -133,14 +134,19 @@ class MimicExplainer(BlackBoxExplainer):
    :param reset_index: Uses the pandas DataFrame index column as part of the features when training
        the surrogate model.
    :type reset_index: str
+    :param auto_select_explainable_model: Set this to 'True' if you want to use the MimicExplainer with an


I wonder if this should be a separate explainer or function - mimic explainer takes a specific surrogate model and not a list. This also seems like something that complicates mimic explainer logic. Maybe we can discuss more.

Thinking of other libraries, usually there is a distinction between hyperparameter tuning and training (eg in both v1 studio and designer there is a Train Model and Tune Hyperparameters or Cross validate module, in spark ML the hyperparameter tuner is a separate estimator, in scikit-learn similarly grid search cv is a separate function). I feel like for users who want to do this we should have a separate function/class instead of complicating the current mimic explainer.

imatiach-msft · 2020-12-31T18:22:47Z

python/interpret_community/mimic/mimic_explainer.py

@@ -304,14 +313,86 @@ def __init__(self, model, initialization_examples, explainable_model, explainabl
        if isinstance(training_data, DenseData):
            training_data = training_data.data

+        self._original_eval_examples = None


this is quite a bit of logic to put inside mimic explainer, I'm really wondering how we could simplify this as mimic explainer is already quite complicated

gaugup added 4 commits December 14, 2020 14:57

[WIP] Choose explainable model MimicExplainer

807aa44

Signed-off-by: Gaurav Gupta <gaugup@microsoft.com>

Add training of multiple explainable models and choosing the best one

767cc5e

Signed-off-by: Gaurav Gupta <gaugup@microsoft.com>

Add unit tests

9f19106

Signed-off-by: Gaurav Gupta <gaugup@microsoft.com>

Merge branch 'master' into gaugup/AutoChooseMostAppropriateExplainabl…

739f95b

…eModel Signed-off-by: Gaurav Gupta <gaugup@microsoft.com>

gaugup requested review from imatiach-msft and gregorybchris December 17, 2020 01:36

Fix flake8 error

641f159

Signed-off-by: Gaurav Gupta <gaugup@microsoft.com>

imatiach-msft requested changes Dec 31, 2020

View reviewed changes

gaugup mentioned this pull request Jan 7, 2021

Add replication metric computation in MimicExplainer #364

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto choose most appropriate explainable model #355

Auto choose most appropriate explainable model #355

gaugup commented Dec 17, 2020

imatiach-msft left a comment

imatiach-msft Dec 31, 2020

imatiach-msft Dec 31, 2020

Auto choose most appropriate explainable model #355

Are you sure you want to change the base?

Auto choose most appropriate explainable model #355

Conversation

gaugup commented Dec 17, 2020

imatiach-msft left a comment

Choose a reason for hiding this comment

imatiach-msft Dec 31, 2020

Choose a reason for hiding this comment

imatiach-msft Dec 31, 2020

Choose a reason for hiding this comment