initial shell implementation of the raitools package #427

imatiach-msft · 2021-03-30T17:44:32Z

The Responsible AI Tools SDK enables users to analyze their machine learning models in one API. Users will be able to analyze errors, explain the most important features, validate fairness, compute counterfactuals and run causal analysis in a single API.

Highlights of the package include:

fairness.add() allows users to run Fairlearn to assess model fairness
explainer.add() allows users to explain their model
counterfactuals.add() allows users to compute counterfactuals
error_analysis.add() allows users to run error analysis
causal.add() allows users to run causal analysis

raianalysis/README.md

raianalysis/requirements.txt

riedgar-ms · 2021-03-30T17:52:29Z

raianalysis/raianalysis/raianalyzer/raianalyzer.py

+        pass
+
+    @staticmethod
+    def load(path):


This is a potential problem. If this is going to load a fully reusable RAIAnalyzer then the saved file is going to have to embed the model.

maybe we can take in a model serializer object that defines how to serialize the model? Otherwise we can allow the user to set the model on the RAIAnalyzer. Allowing them to set the model will break data consistency though, since they may set the wrong model, so having a serializer seems safer.

will leave this question to a future PR, as I won't implement save/load in this PR since it's already quite large for the functionality it already has

raianalysis/raianalysis/raianalyzer/raianalyzer.py

github-actions · 2021-03-30T18:34:32Z

https://responsibleai.blob.core.windows.net/pullrequest/microsoft/responsible-ai-widgets/ilmat/raianalysis/index.html

raitools/raitools/raianalyzer/raianalyzer.py

raitools/raitools/_managers/explainer_manager.py

riedgar-ms · 2021-04-07T17:41:58Z

raitools/raitools/_managers/explainer_manager.py

+        :type classes: list
+        """
+        self._model = model
+        self._initialization_examples = \


I have a feeling that all the managers will be doing something like this... might want to refactor

doesn't fairness use the whole dataset?

That's actually something which popped up in the latest update to the AzureML API proposal. The add() method for the FairnessManager has acquired a dataset argument which can be initialisation evaluation or both (which would concat the two first)

Also, are these going to be copies or references? We don't want to be keeping multiple copies of the data around.

references, but removing datasets on the explanations means we need to make changes in interpret-community, I think that's out of scope for this PR but it might be something we can do long-term

Sorry I'm confused. Why would we need to reinsert them if we want to get rid of them from the explanation? And if we are ok with keeping them on the explanation, and they are references already, why would we want to reinsert them?

Will they still be references after we've saved to disk and reloaded?

If the user wants an explanation, then we would need to reinsert the datasets. However, if they're then firing up the ModelAssessmentDashboard, won't that know how to get the common datasets whenever required, rather than needing them embedded in the explanations themselves? Again, the the transition to the TS layer, will the data remain referenced, or would we end up with multiple copies?

I think you've highlighted multiple issues here, but that code hasn't been implemented -- when it will be implemented, we should make sure to use references and not make copies.

But why not build it in from the beginning?

We certainly could make it so that each component is a literal copy of what we have in the individual dashboards now. For absolute MVP, that will work. However, if we take that approach, then we have technical debt going forward, in removing the duplication. And if we've written files like that, then things are particularly painful since we'll have to keep supporting the older files.

I wrote a new comment on teams. Maybe we can allow users to return explanations without datasets on them. There was already a user request for this:
interpretml/interpret-community#368
I can add this as a new feature in interpret-community.

raitools/raitools/_managers/explainer_manager.py

riedgar-ms · 2021-04-07T17:43:46Z

raitools/raitools/raianalyzer/constants.py

+    """
+
+    Classification = 'classification'
+    Regression = 'regression'


Probability? For when you have a classification problem but want to work with prediction probabilities for each class?

a probability task? Sorry I'm confused. Is there a corresponding list of tasks in fairlearn?

It's one of the options for Fairlearn, where instead of a predicted class, you have a 'probability of class 1'. Perhaps a PM question?

Asked PM to clarify, waiting for PM reply.

actually I'm a bit surprised about this issue because in the mock notebook from PM it just says:

task_type = 'classification or regression'

definitely would be great to get more clarity on this functionality

riedgar-ms · 2021-04-07T17:43:53Z

raitools/raitools/raianalyzer/constants.py

+    """Provide model task constants. Can be 'classification' or 'regression'.
+    """
+
+    Classification = 'classification'


BinaryClassification, surely?

we automatically recognize this in our explainers, I'm not sure about other libraries like fairlearn, but if they need this specified I can add it

We only have metrics for binary classification and regression (and the probability ones). The meaning of things like 'false positive rate' start becoming a little fuzzy when you have multiclass classification. Again, perhaps a PM question.

Asked PM to clarify, waiting for PM reply.

riedgar-ms · 2021-04-07T18:01:52Z

raitools/raitools/_managers/explainer_manager.py

+        if self._is_run:
+            return
+        model_task = ModelTask.Unknown
+        explainer = MimicExplainer(self._model,


Also, do the explainers also embed copies of the data? I've never dug into the details of the structure of the returned objects. If so, then we ought to delete it from this copy, and refer to the centrally held data.

The explanation has a copy of the dataset, and the explainer does too. But, I'm not sure why we would want to delete the dataset from the explanation? Is that just to reduce memory usage?

RAIAnalyzer won't be modifying the input dataset internally, will it?

Well, yes - to reduce memory usage. On the fairness side, I was expecting that we would only store the metrics, and not the copies of the y_true y_pred and sensitive_feature arrays. Those will all add up, for no good reason. Then the dashboard wanted the particular arrays, it could look them up in the central store in the Analyser object.

raitools/README.md

raitools/requirements.txt

raitools/README.md

raitools/raitools/_managers/base_manager.py

gregorybchris · 2021-04-09T16:23:58Z

raitools/raitools/_managers/explainer_manager.py

+SparseNumFeaturesThreshold = 1000
+
+
+class ExplainerManager(BaseManager):


If managers are going to be public we should deliberate a bit on the name of explainability/interpret that we want to use for all of these things. One reason to prefer interpret is that counterfactuals are also considered explanations. The distinction might start to get confusing if we don't have good names for these two components.

agree, but FeatureExplainerManager seems too long, and interpret is never used in code by industry, it's only used in product names. Counterfactual Explanations are different from Feature Importance Explanations, but since the latter is more prevalent I would argue to keep it as just Explainer?

raitools/raitools/_managers/explainer_manager.py

raitools/raitools/raianalyzer/constants.py

raitools/raitools/raianalyzer/raianalyzer.py

raitools/setup.py

raitools/tests/common_utils.py

raitools/raitools/_managers/counterfactual_manager.py

raitools/raitools/_managers/explainer_manager.py

raitools/requirements.txt

raitools/tests/test_raianalyzer.py

romanlutz reviewed Mar 30, 2021

View reviewed changes

riedgar-ms reviewed Mar 30, 2021

View reviewed changes

imatiach-msft changed the title ~~initial shell implementation of the raianalysis package~~ initial shell implementation of the raitools package Apr 2, 2021

imatiach-msft force-pushed the ilmat/raianalysis branch 8 times, most recently from b79c0f7 to 7e01207 Compare April 7, 2021 17:12

riedgar-ms reviewed Apr 7, 2021

View reviewed changes

imatiach-msft force-pushed the ilmat/raianalysis branch 2 times, most recently from 17f740d to 8b87f2d Compare April 8, 2021 15:24

gaugup requested changes Apr 9, 2021

View reviewed changes

raitools/README.md Show resolved Hide resolved

raitools/README.md Outdated Show resolved Hide resolved

raitools/requirements.txt Outdated Show resolved Hide resolved

gregorybchris reviewed Apr 9, 2021

View reviewed changes

imatiach-msft force-pushed the ilmat/raianalysis branch from 8b87f2d to 6d95593 Compare April 12, 2021 15:23

imatiach-msft closed this Apr 12, 2021

imatiach-msft reopened this Apr 12, 2021

gaugup requested changes Apr 12, 2021

View reviewed changes

raitools/raitools/_managers/counterfactual_manager.py Show resolved Hide resolved

imatiach-msft force-pushed the ilmat/raianalysis branch from 6d95593 to 36aeaa7 Compare April 12, 2021 17:09

gregorybchris approved these changes Apr 12, 2021

View reviewed changes

gaugup requested changes Apr 12, 2021

View reviewed changes

raitools/raitools/_managers/explainer_manager.py Show resolved Hide resolved

raitools/raitools/_managers/explainer_manager.py Show resolved Hide resolved

raitools/requirements.txt Outdated Show resolved Hide resolved

raitools/tests/test_raianalyzer.py Show resolved Hide resolved

initial shell implementation of the raianalysis package

e62442f

imatiach-msft force-pushed the ilmat/raianalysis branch from 36aeaa7 to e62442f Compare April 12, 2021 18:14

gaugup approved these changes Apr 12, 2021

View reviewed changes

fix 403 forbidden on brew install

6590dbe

imatiach-msft merged commit 776dc98 into main Apr 12, 2021

imatiach-msft deleted the ilmat/raianalysis branch April 12, 2021 20:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

initial shell implementation of the raitools package #427

initial shell implementation of the raitools package #427

imatiach-msft commented Mar 30, 2021 •

edited

Loading

riedgar-ms Mar 30, 2021

imatiach-msft Apr 8, 2021 •

edited

Loading

imatiach-msft Apr 8, 2021

github-actions bot commented Mar 30, 2021

riedgar-ms Apr 7, 2021

imatiach-msft Apr 7, 2021

riedgar-ms Apr 7, 2021

riedgar-ms Apr 7, 2021

imatiach-msft Apr 7, 2021

imatiach-msft Apr 7, 2021

riedgar-ms Apr 7, 2021

imatiach-msft Apr 8, 2021

riedgar-ms Apr 8, 2021 •

edited

Loading

imatiach-msft Apr 8, 2021

riedgar-ms Apr 7, 2021

imatiach-msft Apr 7, 2021

riedgar-ms Apr 7, 2021

imatiach-msft Apr 8, 2021

imatiach-msft Apr 8, 2021

riedgar-ms Apr 7, 2021

imatiach-msft Apr 7, 2021

riedgar-ms Apr 7, 2021

imatiach-msft Apr 8, 2021

riedgar-ms Apr 7, 2021

imatiach-msft Apr 7, 2021

imatiach-msft Apr 7, 2021

riedgar-ms Apr 7, 2021

gregorybchris Apr 9, 2021

imatiach-msft Apr 12, 2021

		SparseNumFeaturesThreshold = 1000


		class ExplainerManager(BaseManager):

initial shell implementation of the raitools package #427

initial shell implementation of the raitools package #427

Conversation

imatiach-msft commented Mar 30, 2021 • edited Loading

Choose a reason for hiding this comment

imatiach-msft Apr 8, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Mar 30, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

riedgar-ms Apr 8, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

imatiach-msft commented Mar 30, 2021 •

edited

Loading

imatiach-msft Apr 8, 2021 •

edited

Loading

riedgar-ms Apr 8, 2021 •

edited

Loading