Input to Inference Engine #7580

sidgoyal78 · 2018-01-17T02:09:19Z

Input protobuf for Inference Engine

Please refer to Xreki's excellent PR #7315 to get an overview of the design of Inference Engine.

The design of inference engine depends on:

how we store the protobuf message of the ProgramDesc of the main_program in fluid.

The aim of this document is to look at two different approaches for doing it, and evaluate some of the pros and cons of each approach.

If we look at an existing training and inference example in fluid for example recognize_digits, we see that there are two objects of the Program class in Python, each of which has a ProgramDesc as its member.

The ProgramDesc obtained from the default_main_program() basically represents the neural network during the training phase. However, for testing/inference phase, as in the above example, we prune() and inference_optimize() the original ProgramDesc, to prune out irrelevant operators and obtain a ProgramDesc (in the inference_program) which is suitable for inference. (For more details, see implementation: link). To simplify the writing, we call the first ProgramDesc as "training ProgramDesc" and the latter one as "inference ProgramDesc". An important thing to note is that "inference ProgramDesc" has lesser information that the "training ProgramDesc" as we prune out operators which aren't required for inference.

Approach 1: Use inference ProgramDesc

Under the subsection inference-program of the PR 7315, it is proposed that the "protobof message of the main_program is saved using fluid.io.save_inference_model."

Based on the current implementation of fluid.io.save_inference_model, we observe that "inference ProgramDesc" is stored (in addition to model parameters). Now, there are again two options:

We can modify the current implementation to save the inference ProgramDesc which will have feed and fetch operators as well (which isn't done in the current implementation). This has the benefit that the user who wants to do inference, doesn't need to worry about providing feed and fetch lists to the Inference engine API.
We use the current implmentation as is, and save the inference ProgramDesc without feed and fetch operators. Here, the user must provide feed and fetch lists as input to the Inference engine API.

However, the main drawback of both of the above options and saving inference ProgramDesc is that we will need to have extra provisions to allow online training. More specifically, we will need to also save the "training ProgramDesc" and read it additionally to support online training. Moreover, we might face issues related to parameter sharing when we want to do both inference and also modify the parameters.

Approach 2: Use training ProgramDesc

To address the limitation of the first approach, at the cost of extra computation, is to save the "training ProgramDesc". In order to do inference, we can perform pruning and optimization on this ProgramDesc to obtain the "inference ProgramDesc". However, in order to do the pruning, we will need the user to specify feed and fetch lists to the Inference engine.

Even though we will have to perform pruning on the "training ProgramDesc" in the Inference Engine, we will still be able to support online training, as we won't have to worry about saving/reading an additional "training ProgramDesc".

The text was updated successfully, but these errors were encountered:

kexinzhao · 2018-01-17T04:32:11Z

@sidgoyal78, after discussing with @Xreki, we think that all three proposals are pursuable. The priorities are 1.1 > 1.2 > 2.

sidgoyal78 · 2018-01-18T04:05:16Z

So I think 1.2 is already done by @Xreki's original PR #7097 .
And now @kexinzhao's PR #7636 does 1.1
Maybe I will work on 2.

sidgoyal78 changed the title ~~Input protobuf to Inference Engine~~ Input to Inference Engine Jan 17, 2018

sidgoyal78 assigned Xreki Jan 17, 2018

Xreki mentioned this issue Jan 17, 2018

Add design doc of inference API for fluid. #7315

Closed

Xreki added the 预测原名Inference，包含Capi预测问题等 label Jan 17, 2018

Xreki closed this as completed Apr 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input to Inference Engine #7580

Input to Inference Engine #7580

sidgoyal78 commented Jan 17, 2018 •

edited

Loading

kexinzhao commented Jan 17, 2018

sidgoyal78 commented Jan 18, 2018

Input to Inference Engine #7580

Input to Inference Engine #7580

Comments

sidgoyal78 commented Jan 17, 2018 • edited Loading

Input protobuf for Inference Engine

Approach 1: Use inference ProgramDesc

Approach 2: Use training ProgramDesc

kexinzhao commented Jan 17, 2018

sidgoyal78 commented Jan 18, 2018

sidgoyal78 commented Jan 17, 2018 •

edited

Loading