Warm up layers models when they are loaded. #224

nsthorat · 2018-04-24T02:14:40Z

If we pass a tensor of zeros to the model it will compile all the shader programs and upload weights to the GPU, thus making the first inference much faster.

This should be an easy win.

bileschi · 2018-04-24T16:25:09Z

Can you explain a little more? Something like calling model.predict internally as the last step in model.compile ?

nsthorat · 2018-04-24T17:04:53Z

Actually I was thinking model.predict(tf.zeros(inputLayer.shape)); as the last step of tf.loadModel.

bileschi · 2018-04-24T17:48:17Z

Seems reasonable. Quick question: In general a model might have input layer(s) with incomplete shapes, say [?, 100, 100], typically for batch. If we guess the batch size, and are incorrect, does the warm-up behavior still work, or does the GPU memory need to be re-allocated because of the shape change?

dsmilkov · 2018-04-24T18:49:03Z

Good q. In that case, we should warmup with the most common case in inference, batchSize=1. If we happen to be wrong, some of the GPU programs will be re-compiled, but that's fine.

nsthorat · 2018-04-24T18:58:04Z

We should warm up with batchSize = 1 since it will be faster. Having batchSize > 1 likely will not change which programs are compiled / weights are uploaded. It's also likely that users will be using a batchSize of 1 for inference, anyways.

* 0.3.x: Update the publish-npm script to allow publishing from the release branch. (#203) DEV * Upgrade 0.3.x to 0.15.3 (#210)  This change is [<img src="https://reviewable.io/review_button.svg" height="34" align="absmiddle" alt="Reviewable"/>](https://reviewable.io/reviews/tensorflow/tfjs-node/210)  * Fix win GPU packaging. (#208) (#211) Turns out that the windows GPU builds for TensorFlow 1.12 lack the directory structure and eager headers. A bug has been filed with core TF - but we should bake in some fixes for this. This PR simply refactors the downloading logic to a new file. I'd like to use this logic in the node-gles package as well (maybe worth releasing as a stand-alone package in the future). After the refactoring, I check the directory structure in Windows. If the folder structure is missing, but the required tensorflow.dll exists - I re-create the directory structure, move and download the proper header files. The screenshot below shows the contents of TF 1.12 Windows GPU: ![capture](https://user-images.githubusercontent.com/306276/53048799-719f4b80-344a-11e9-9004-3eef2446a246.PNG)  This change is [<img src="https://reviewable.io/review_button.svg" height="34" align="absmiddle" alt="Reviewable"/>](https://reviewable.io/reviews/tensorflow/tfjs-node/208)  * Bump 0.3.1 * Add TensorBoard callback for model training: tf.node.tensorBoard() (#202) (#213) FEATURE See example screenshot: ![image](https://user-images.githubusercontent.com/16824702/52491877-19d52a80-2b96-11e9-8c24-5a403c2450d3.png) Fixes #686 * [0.3.x] Upgrade nyc package fo fix lodash security issue. (#218) (#219) Bump for 0.3.x so we can get a security release spun. https://github.com/tensorflow/tfjs-node/network/alert/yarn.lock/lodash/open  --- This change is [<img src="https://reviewable.io/review_button.svg" height="34" align="absmiddle" alt="Reviewable"/>](https://reviewable.io/reviews/tensorflow/tfjs-node/219)  * Bump to 0.3.2 * Upgrade TS libraries and change binding from typings file to plain TypeScript definition. * Upgrade TS dependencies * save * Fix deps-stage * save * Revert TS changes and keep binary staging fixes. * Don't use a definition file for the bindings. This causes many issues and doesn't help with redistribution. It looks like exporting a local definition file on top of what else is exported is not a common supported TypeScript use case. This fix simply moves defnitions into a normal TypeScript file. This should fix: #1092 * save * Add typescript integration project. * save * Add license

rikkitook · 2021-05-19T10:56:29Z

Little side note: we noticed that on the full tensorflow with gpu enabled, consecutive calls with different batch sizes leads to session reheating. Like, for example, first call with batchsize = 1 is long, first call with batchsize = 5 is also long on the same session

gaikwadrahul8 · 2023-10-11T14:28:02Z

Hi, @nsthorat

Thank you for opening this issue for tracking purposes. Since this issue has been open for a long time, the code/debug information for this issue may not be relevant with the current state of the code base.

The TFJs team is constantly improving the framework by fixing bugs and adding new features. We suggest you try the latest TFJs version with the latest compatible hardware configuration which could potentially resolve the issue. We can keep the issue open if it is still relevant. Please confirm if we need to keep the issue open.

Thank you for your support and cooperation.

gaikwadrahul8 · 2023-10-18T19:47:00Z

Hi, @nsthorat

We have not received any confirmation from you, so we are closing this issue now. If the issue is still relevant, please let us know, either we'll re-open this issue or please feel free to create new issue after trying with latest version of Tensorflow.js. Thank you!

nsthorat added the comp:layers label Apr 24, 2018

nsthorat added the P1 label Oct 24, 2018

nsthorat assigned caisq Oct 24, 2018

nsthorat added P2 and removed P1 labels Oct 24, 2018

caisq added type:feature New feature or request type:performance labels Feb 12, 2019

pyu10055 assigned lina128 Sep 1, 2021

gaikwadrahul8 assigned fengwuyao and unassigned caisq and lina128 May 17, 2023

gaikwadrahul8 removed the type:performance label Aug 10, 2023

gaikwadrahul8 added the stat:awaiting tensorflower label Sep 5, 2023

gaikwadrahul8 self-assigned this Oct 11, 2023

gaikwadrahul8 added stat:awaiting response and removed stat:awaiting tensorflower labels Oct 11, 2023

gaikwadrahul8 closed this as completed Oct 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Warm up layers models when they are loaded. #224

Warm up layers models when they are loaded. #224

nsthorat commented Apr 24, 2018

bileschi commented Apr 24, 2018

nsthorat commented Apr 24, 2018

bileschi commented Apr 24, 2018

dsmilkov commented Apr 24, 2018

nsthorat commented Apr 24, 2018

rikkitook commented May 19, 2021

gaikwadrahul8 commented Oct 11, 2023

gaikwadrahul8 commented Oct 18, 2023

Warm up layers models when they are loaded. #224

Warm up layers models when they are loaded. #224

Comments

nsthorat commented Apr 24, 2018

bileschi commented Apr 24, 2018

nsthorat commented Apr 24, 2018

bileschi commented Apr 24, 2018

dsmilkov commented Apr 24, 2018

nsthorat commented Apr 24, 2018

rikkitook commented May 19, 2021

gaikwadrahul8 commented Oct 11, 2023

gaikwadrahul8 commented Oct 18, 2023