Update mdf_to_pytorch to current spec #10

pgleeson · 2021-04-07T18:33:37Z

@patrickstock The current example of mdf json you use is different from the format in the current python api,

https://github.com/ModECI/MDFTests/blob/main/mdf_to_pytorch/example_mdfs/mlp_classifier/mlp_classifier.json

Some differences:

it uses lists for functions. Can you update those to use dicts?
A "value" attribute is required on the output nodes
reciever_port -> receiver_port

Ignoring for now the loading of the weights from h5 for params, it should be possible to load this and export to other formats already, e.g. graphviz with these changes, e.g. using python -m modeci_mdf.export.graphviz mlp_classifier.json 2 with the latest main of mdf (using a hacked version of your file):

The text was updated successfully, but these errors were encountered:

patrickstock · 2021-04-07T22:07:56Z

@pgleeson I fixed and pushed item 2 and 3, but I thought we had agreed on lists to contain a node's functions so that an order is implicitly specified by their order in the list?

pgleeson · 2021-04-08T17:08:20Z

@patrickstock This was under discussion here: ModECI/MDF#18, but @kmantel seemed to be coming round to using dictionaries for these. The main point was that putting the functions in a certain order is not a guarantee that that is sufficient to work out the order of evaluation.

As a general point though it might be best to default to the usage/naming/conventions in the Python API if at all possible and wait until any differences with the written spec are hammered out in the issues: https://github.com/ModECI/MDF/labels/specification. Will make it easier to test compatibility of implementations across codebases.

patrickstock · 2021-04-12T18:31:56Z

@pgleeson Not using lists works for me. Although if we want to unambiguously specify the order of execution of multiple functions in a single node, it seems like a "WhenFinished" condition would be necessary or some other similar control flow. My concern though is that we have only defined NodeSpecific and termination conditions in our docs, and I don't know where the conditions to specify the execution of intra-node functions would go, without breaking these out to being only single-function nodes themselves?

It seems to me we would need to add a category like "IntraNode" conditions, and then use dot-notion naming for these such as:
"Node.Function2":{"type": "WhenFinished", "kwargs":{"dependency":"Node.Function1"}}

If you agree with this I will refactor. If we want to postpone this idea, I can break everything out to single-function nodes for the purpose of this example.

patrickstock · 2021-04-13T01:24:40Z

Just to clarify, I see how multiple functions to a given node work here: https://github.com/ModECI/MDF/blob/main/examples/Simple.json
Where the output port only specifies returning logistic_1, but my concern is if more than one function were to be run on a given node pass, how to specify that.

kmantel · 2021-04-13T02:57:39Z

For Simple.json, the lines

for linear_1:
"variable0": "input_port1",

for logistic_1:
"variable0": "linear_1",

give linear then logistic. This seems better than what I expected in ModECI/MDF#18, but I don't see how this is any more specific in terms of ordering than just ordering in a list. I think if (in the old version) the spec was roughly

[
    "custom_1": "3 * linear_1()",    
    "linear_1": {...},
]

then this is just an invalid model, rather than an ambiguous but valid model.

Either way, It seems fair to require any intra-node functions to execute in an explicit order, once each, specified using this variable0 key, and that if any more complex patterns are needed then the functions should belong to separate nodes.

pgleeson · 2021-04-13T17:30:52Z

The above is the graphical representation of the https://github.com/ModECI/MDF/blob/main/examples/Simple.yaml example.

Clearly there is an order in which variables inside the node need to be calculated: input_port1 -> linear_1 -> logistic_1 -> output_1. Even if the functions were mixed up in the yaml file, there would still be only one order in which a program can and should execute these (determined by the values of the attributes, i.e. linear_1 is neded by logistic_1, so evaluate linear_1 before it).

The main question is: If there is enough information in the set of function definitions to unambiguously determine the proper order, should we get the user to annotate/order them to make it easier for programs to execute them?

My personal preference is no; keep the specification files simple, and throw an error if it can't be determined form the values of the arguments. Helper methods in the API can be used for determining the order (just needs to be worked out once on model loading) and this can be used every time the node needs to be evaluated.

In this first version of MDF I would see that the full sequence of evaluations from inputs to functions to outputs in a node is always run to completion, so there should be no issue of anything else in the graph influencing whether a function is run or the order.

pgleeson · 2021-04-14T17:41:51Z

FYI 1: I've added functionality to the simple scheduler here: https://github.com/ModECI/MDF/blob/cdae3be54e05253400c488758f23d0eb3989373a/src/modeci_mdf/simple_scheduler.py#L132-L157 to work out the proper sequence of function calls in a node. The functions can be written in the Graph in any order, but the EvaluableGraph that's returned here has the EvaluableFunctions in the correct order.

FYI 2: @patrickstock Even without incorporating these changes for functions, could you ensure the mlp_test.py example is working with your code and updated mdf script, and open a new PR? I can't get it running here, and would like to test running an equivalent version of the MLP in MDF built via the API.

patrickstock added a commit that referenced this issue Apr 7, 2021

Issue #10 matched spec

5d6c1c0

patrickstock mentioned this issue Apr 14, 2021

fixed mdf2pytorch to run mlp_test.py with new function format #11

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update mdf_to_pytorch to current spec #10

Update mdf_to_pytorch to current spec #10

pgleeson commented Apr 7, 2021

patrickstock commented Apr 7, 2021

pgleeson commented Apr 8, 2021

patrickstock commented Apr 12, 2021

patrickstock commented Apr 13, 2021

kmantel commented Apr 13, 2021

pgleeson commented Apr 13, 2021

pgleeson commented Apr 14, 2021

Update mdf_to_pytorch to current spec #10

Update mdf_to_pytorch to current spec #10

Comments

pgleeson commented Apr 7, 2021

patrickstock commented Apr 7, 2021

pgleeson commented Apr 8, 2021

patrickstock commented Apr 12, 2021

patrickstock commented Apr 13, 2021

kmantel commented Apr 13, 2021

pgleeson commented Apr 13, 2021

pgleeson commented Apr 14, 2021