Proposal for WASI-nn: a machine learning module #272

abrown · 2020-05-07T23:48:21Z

This issue is a tracking issue for discussing the addition of a machine learning module to WASI. I created a very rough draft of what the API could look like, wasi_ephemeral_nn.witx:

it is loosely inspired by the WebNN API, hence the name WASI-nn
it is scoped only for inference; we can discuss further below but removing the necessity to define and train the execution graph makes the API much simpler (e.g. WebNN has a graph builder API but keeping up with all the newest kernels is not something I wanted to tackle yet)
it accepts executions as opaque byte sequences, so the graph/model encoding format is not understood by this API and is only indicated by the $graph_encoding flag

Please let me know what you think about this approach!

The text was updated successfully, but these errors were encountered:

leonwanghui · 2020-05-08T01:36:54Z

Hi @abrown , firstly I think it's a great idea! As a machine learning framework contributor, I believe the portability of WASM/WASI can be fitted well in training/inferencing across hardware platforms.

But from what I have learned, WebNN API has been acted as user-oriented like what Android NN API does (see figure below). But for WASI-nn API, I guess it should be more like device-oriented (see Android NN HAL in the figure). Please correct me if I made some misunderstanding.

mingqiusun · 2020-05-08T05:13:52Z

@leonwanghui Thanks for the inputs. Our view is that most machine learning training work is and will be done in a high level language such as python. Once a model is built, it needs to be deployed to a multitude of devices. This inferencing part is where WASM shines with its portability, and is the focus of our current proposal. The API is designed to be framework and model format agnostic. And we expect those WASI calls to be implemented for CPU, GPU, TPU, etc.

bhack · 2020-05-08T13:46:44Z

Our view is that most machine learning training work is and will be done in a high level language such as python. Once a model is built, it needs to be deployed to a multitude of devices. This inferencing part is where WASM shines with its portability, and is the focus of our current proposal. The API is designed to be framework and model format agnostic. And we expect those WASI calls to be implemented for CPU, GPU, TPU, etc.

Probably this is the current dominant scenario but research is also going to be oriented toward figure out a different role for edge (nodes/devices) so I suppose that we could try to be a little bit future-proof to let the design be extensible to the next scenarios and use cases.

See D. Practical Training Principles at Edge in Convergence of Edge Computing and DeepLearning: A Comprehensive Survey

EDIT:
On the same topic see also IV. EDGE TRAINING section in A Survey on Edge Intelligence

arunetm · 2020-05-08T15:51:56Z

Most of the web frameworks for ML already seem to take a staged design approach focusing on inference first with future support for training. One of the reasons is that use cases for inference on web are easier to come by and will aid a useful design.
Future proofing the spec is ideal for sure. As training on web still seem to be in its early stages and future proofing the design well for training at this stage could be challenging. Unless there is significant overlap between features necessary for training and inference, treating them independently might be useful for landing features for inference use-cases sooner.

mingqiusun · 2020-05-08T18:04:34Z

@backes Interesting survey article! Even though we are focusing on inferencing now, future-proof is definitely one of our design goals. For example, future APIs could be added to support back propagation without changing our current model loading and forward propagation APIs. But if you find limitation in the current proposal that prevents future expansion, please let us know.

bhack · 2020-05-08T18:15:58Z

It was just to be aware on the general trends, then it is ok to restrict the design in stages if have a good general overview.

leonwanghui · 2020-05-09T00:49:13Z

@mingqiusun Thanks for the elaboration, and it seems much more clear to me. But something still confusing is that from what you mentioned, this WASI-nn API should be working when compiling the model (with the network parameters) into *.wasm and executing the model using wasm runtime (such like wasmtime). If so, does that mean we need to have both wasi-nn-sdk and wasm-nn-c-api to implement the WASI-nn API?

Another concern is that if the current scope of WASM-nn API is only for inference scenario, how can we make sure the compatibility of operator implementation in device (cpu, gpu, tpu, npu, etc) when compared to training scenario?

mingqiusun · 2020-05-09T01:17:44Z

@leonwanghui This WASI-NN API would standardize how a WASM program loads and executes a NN model, just like any other WASI system calls.

We expect a device vendor to provide NN framework and graph encoding support. Load method would return an error message when an unsupported model encoding scheme is passed in. This approach is similar to how a browser deals with image or video encoding.

leonwanghui · 2020-05-09T01:39:38Z

@mingqiusun Cool, actually I'm working for prototyping wasm backend for MindSpore framework, although this ms-backend-wasm project is at very early stage, I'm really interested in implementing the PoC of WASI-nn API.

abrown · 2020-05-14T16:09:55Z

For those of us who are less witx-savvy, here's the generated documentation: docs.md.

tqchen · 2020-05-15T14:57:18Z

FYI, for those who are interested in ML on wasm. https://tvm.apache.org/2020/05/14/compiling-machine-learning-to-webassembly-and-webgpu

linclark · 2020-12-04T03:03:25Z

Development on wasi-nn has moved to its own repo and is making good progress, so I'm going to close this one out. Any follow-up questions can be asked in that repo.

huningxin mentioned this issue May 8, 2020

Web Neural Network API as-is in WASI? webmachinelearning/webnn#32

Open

tschneidereit mentioned this issue May 15, 2020

WASI: expand networking functions bytecodealliance/wasmtime#70

Closed

linclark closed this as completed Dec 4, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal for WASI-nn: a machine learning module #272

Proposal for WASI-nn: a machine learning module #272

abrown commented May 7, 2020 •

edited

Loading

leonwanghui commented May 8, 2020

mingqiusun commented May 8, 2020

bhack commented May 8, 2020 •

edited

Loading

arunetm commented May 8, 2020

mingqiusun commented May 8, 2020 •

edited

Loading

bhack commented May 8, 2020

leonwanghui commented May 9, 2020

mingqiusun commented May 9, 2020

leonwanghui commented May 9, 2020

abrown commented May 14, 2020

tqchen commented May 15, 2020

linclark commented Dec 4, 2020

Proposal for WASI-nn: a machine learning module #272

Proposal for WASI-nn: a machine learning module #272

Comments

abrown commented May 7, 2020 • edited Loading

leonwanghui commented May 8, 2020

mingqiusun commented May 8, 2020

bhack commented May 8, 2020 • edited Loading

arunetm commented May 8, 2020

mingqiusun commented May 8, 2020 • edited Loading

bhack commented May 8, 2020

leonwanghui commented May 9, 2020

mingqiusun commented May 9, 2020

leonwanghui commented May 9, 2020

abrown commented May 14, 2020

tqchen commented May 15, 2020

linclark commented Dec 4, 2020

abrown commented May 7, 2020 •

edited

Loading

bhack commented May 8, 2020 •

edited

Loading

mingqiusun commented May 8, 2020 •

edited

Loading