Large latency on Tensor allocation


**System information**
- TensorFlow version: 0.3.1
- Doc Link: https://www.tensorflow.org/jvm/install


**Describe the documentation issue**

Not sure this is the correct forum but I would like to some guidance on how to setup sessions and resource management would be interesting. 

After two weeks trying to understand why latencies in 0.3.1 were completely uncontrollable (as compared to official 1.3.1) I ran into #208. This matches my observations. 

We are trying to run Prediction on models with thousands of data points in different tensor per prediction. Memory allocation on the threads are in 20MB/s and there seems to be a sync between JavaCCP Allocation thread and our worker threads. In addition to this allocation using Size(1) tensor seems to be very slow (in the 7ms range). 

After reading #208, it seems we are doing everything wrong. But I don't really have a clear picture of how it should be done: Would `EagerSession` help? Could I use a `Session` per HTTP request? Should I allocate larger multi-dimensional  tensors instead of a single one? How should I configure thread pools?  I understand that the API is work in progress, but current documentation is very light on this kind of documentation.

I don't think this is a bug, but I can convert into some other sort of issue. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Large latency on Tensor allocation #313

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Large latency on Tensor allocation #313

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions