[PARO-720] ENH Model sharing helpers #315

stephen-hoover · 2019-08-09T03:48:44Z

A CivisML model has File and Project run outputs, each of which need to be shared to effectively share the "model". This helper will transparently handle sharing the run outputs. Include JSONValue in case a future version of CivisML uses that. The functions are patterned after the autogenerated API sharing endpoints.

stephen-hoover · 2019-08-19T13:59:43Z

@elsander , see what you think of the naming and other design choices here. It's not obvious what the right interface would be. I considered adding a list_models_shares function as well, but doing that right would be really complex, and I wasn't sure it was worth it.

elsander · 2019-08-20T14:40:43Z

civis/ml/tests/test_helper.py

+
+
+def test_share_model_debug_log():
+    # Debug logs need "write" permission to be shared


This isn't necessarily the behavior I'd expect. Why do we only share debug logs to people with write permissions?

My reasoning was that the debug logs are more internal workings of the model. If you just want someone to be able to see your model, with the lowest levels of permissions, you might not want to expose all of the logging. I'm certainly open to discussion on that.

I think you're right in most cases, a person who only needs read permissions won't need access to the debug logs. But I don't think it would ever be a problem to expose the debug logs, and there are some cases where you might expect read permissions to provide log access. Probably the most common is CS sharing one of us on a client job so that we can examine the logs and debug, but I could also imagine clients sharing jobs to debug internally, before they escalate to CS. My preference is to share all run outputs for consistency, but I'm open to discussion.

Okay, this makes the code simpler, so I'm happy to make the change. I'm not thinking of anything sensitive in the logs which wouldn't also be in the other model artifacts which would be shared at read level.

elsander · 2019-08-20T20:20:34Z

civis/ml/_helper.py

+            elif _output['object_type'] == 'Project':
+                _func = getattr(client.projects, "put_shares_" + entity_type)
+                if permission_level == 'read':
+                    # Users must be able to add to projects to use the model


Just for my own understanding, why is this?

The CivisML training job keeps a "scoring jobs" project as a run output. CivisML scoring jobs add themselves to this project so that users (and the Civis Platform UI) can link from training jobs to all dependent scoring jobs. Users need "write" permission to add things to projects.

I reasoned that users would expect that "read" permission on a model would give them the ability to make their own scoring jobs based on it. Does that make sense? The project is the only thing which would prevent you from scoring with only read permissions, and the error you get is a bit cryptic.

Hmm, I didn't think about whether a "read" user should be able to score based off the original model rather than a clone. That reasoning makes sense to me, thanks.

Follow-up question: what permissions does the original user have on a scoring job created by the new user in this way?

The original user wouldn't have any special permissions on the scoring job by default. If you don't have "read" permission on the scoring job, it wouldn't be visible to you in the project. I think this is expected behavior -- I could imagine the "scoring" project being filled with scoring jobs, and no one user being able to see all of them.

Sounds good. I agree that this is expected behavior.

elsander · 2019-08-20T20:23:58Z

civis/ml/_helper.py

+                                "put_shares_" + entity_type)
+                obj_permission = permission_level
+            else:
+                continue


Would it make sense to issue a message to the debug log if you hit this condition?

Also, are there any outputs where we'd expect to hit this condition? If you write OOS scores to a table, does that show up as a run output?

Yes, a debug message definitely makes sense. I'll add that.

I don't expect to ever hit this condition. I believe that Tables are the only possible run output not covered in this loop, and you can't grant people permissions on tables through the API endpoints. CivisML doesn't add any tables as run outputs, AFAIK.

elsander · 2019-08-20T20:28:06Z

civis/ml/_helper.py

+                _func = getattr(client.projects, endpoint_name)
+            elif _output['object_type'] == 'JSONValue':
+                _func = getattr(client.json_values, endpoint_name)
+            else:


Same note here, would it be useful to log skipped outputs?

Yes, will do.

elsander · 2019-08-20T20:30:25Z

civis/ml/_helper.py

+
+    Returns
+    -------
+    readers : dict::


I'm confused by the type annotation here. It looks like "users" and "groups" are lists of dicts, right? Should this say list[dict] to specify that explicitly?

I copy-pasted this directly from the corresponding doc strings for other sharing endpoints. I'll take a look.

The readers is a single dictionary. It contains keys users and groups, each of which is a list of dictionaries. In Python notation, I think you're right that the types of users and groups would be list[dict], but I think that what's show here is also correct, and consistent with the type notation used in the other API endpoints.

elsander · 2019-08-20T20:38:53Z

I think the function names make a lot of sense. I could see the list function being useful, but if it's complex to implement it's probably not worth doing right now. If we hear requests for it we could reevaluate.

stephen-hoover · 2019-08-25T20:55:25Z

@elsander , I addressed your comments.

elsander

LGTM!

stephen-hoover added the enhancement label Aug 9, 2019

stephen-hoover added this to the Next Version milestone Aug 9, 2019

PEP8

873d177

stephen-hoover changed the title ~~ENH Model sharing helpers~~ [PARO-720] ENH Model sharing helpers Aug 9, 2019

Stephen Hoover added 6 commits August 8, 2019 23:10

COMPAT No f-strings

a666b69

Only share debug logs at write and above

9ac4a02

COMPAT Remove keyword-only arguments for Python 2 compat

8c28769

ENH Add delete shares functions for models

612a2fe

DOC Mention sharing in the user docs

dd5a8e8

REF PEP8

cb59bc9

stephen-hoover requested a review from elsander August 19, 2019 13:58

stephen-hoover assigned elsander Aug 19, 2019

elsander suggested changes Aug 20, 2019

View reviewed changes

elsander assigned stephen-hoover and unassigned elsander Aug 20, 2019

patr1ckm mentioned this pull request Aug 23, 2019

ENH helper function to share models civisanalytics/civis-r#198

Open

Stephen Hoover added 2 commits August 25, 2019 15:39

CR

df1a9f3

Merge remote-tracking branch 'upstream/master' into share-models

15e9f16

stephen-hoover assigned elsander and unassigned stephen-hoover Aug 25, 2019

elsander approved these changes Aug 26, 2019

View reviewed changes

elsander assigned stephen-hoover and unassigned elsander Aug 26, 2019

stephen-hoover merged commit eded881 into civisanalytics:master Aug 26, 2019

stephen-hoover deleted the share-models branch August 26, 2019 14:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PARO-720] ENH Model sharing helpers #315

[PARO-720] ENH Model sharing helpers #315

stephen-hoover commented Aug 9, 2019

stephen-hoover commented Aug 19, 2019

elsander Aug 20, 2019

stephen-hoover Aug 23, 2019

elsander Aug 23, 2019

stephen-hoover Aug 23, 2019

elsander Aug 20, 2019

stephen-hoover Aug 23, 2019

elsander Aug 23, 2019

stephen-hoover Aug 23, 2019

elsander Aug 23, 2019

elsander Aug 20, 2019

stephen-hoover Aug 23, 2019

elsander Aug 20, 2019

stephen-hoover Aug 23, 2019

elsander Aug 20, 2019

stephen-hoover Aug 23, 2019

stephen-hoover Aug 25, 2019

elsander commented Aug 20, 2019

stephen-hoover commented Aug 25, 2019

elsander left a comment



		def test_share_model_debug_log():
		# Debug logs need "write" permission to be shared

[PARO-720] ENH Model sharing helpers #315

[PARO-720] ENH Model sharing helpers #315

Conversation

stephen-hoover commented Aug 9, 2019

stephen-hoover commented Aug 19, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

elsander commented Aug 20, 2019

stephen-hoover commented Aug 25, 2019

elsander left a comment

Choose a reason for hiding this comment