Decreasing inference time on cpu #31

M-Zubair10 · 2023-04-09T21:14:53Z

Thanks for this awesome model, it does evaluate good with pretrained model

Right now i am getting 15s average inference time, any way to reduce it to 2-3s

SlongLiu · 2023-04-11T08:34:22Z

It is a good question, but we have not explored it yet. The straightest way is to deploy the model like ONNX.

GeorgePearse · 2023-04-12T14:39:38Z

@SlongLiu should the operations in the model already support ONNX export? e.g.

code vaguely similar to the below should work?

# Export the model
torch.onnx.export](https://pytorch.org/docs/stable/onnx.html#torch.onnx.export)(torch_model,               # model being run
                  x,                         # model input (or a tuple for multiple inputs)
                  "super_resolution.onnx",   # where to save the model (can be a file or file-like object)
                  export_params=True,        # store the trained parameter weights inside the model file
                  opset_version=10,          # the ONNX version to export the model to
                  do_constant_folding=True,  # whether to execute constant folding for optimization
                  input_names = ['input'],   # the model's input names
                  output_names = ['output'], # the model's output names
                  dynamic_axes={'input' : {0 : 'batch_size'},    # variable length axes
                                'output' : {0 : 'batch_size'}})

I'll look in to optimising it with tools like OpenVINO if it does.

Weizhongjin · 2023-04-23T09:30:02Z

i try to use torch.onnx.export to transfer grounding-dino to onnx, it seems like some problem , such as several logical operators are not supported by onnx. for example:
torch.onnx.symbolic_registry.UnsupportedOperatorError: Exporting the operator ::_ior to ONNX opset version 13 is not supported
do you know which part of model make this problem. @GeorgePearse

oylz · 2023-06-28T14:50:12Z

here

Dratlan · 2023-08-01T08:52:59Z

i try to use torch.onnx.export to transfer grounding-dino to onnx, it seems like some problem , such as several logical operators are not supported by onnx. for example: torch.onnx.symbolic_registry.UnsupportedOperatorError: Exporting the operator ::_ior to ONNX opset version 13 is not supported do you know which part of model make this problem. @GeorgePearse

Have you solve it please

kobic8 · 2023-08-10T10:33:10Z

@SlongLiu should the operations in the model already support ONNX export? e.g.

code vaguely similar to the below should work?

# Export the model
torch.onnx.export](https://pytorch.org/docs/stable/onnx.html#torch.onnx.export)(torch_model,               # model being run
                  x,                         # model input (or a tuple for multiple inputs)
                  "super_resolution.onnx",   # where to save the model (can be a file or file-like object)
                  export_params=True,        # store the trained parameter weights inside the model file
                  opset_version=10,          # the ONNX version to export the model to
                  do_constant_folding=True,  # whether to execute constant folding for optimization
                  input_names = ['input'],   # the model's input names
                  output_names = ['output'], # the model's output names
                  dynamic_axes={'input' : {0 : 'batch_size'},    # variable length axes
                                'output' : {0 : 'batch_size'}})

I'll look in to optimising it with tools like OpenVINO if it does.

am also interested in onx export code of GDINO

Vektor284 mentioned this issue Oct 25, 2024

ONNX Conversion Script for GroundingDino rhysdg/vision-at-a-clip#30

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decreasing inference time on cpu #31

Decreasing inference time on cpu #31

M-Zubair10 commented Apr 9, 2023

SlongLiu commented Apr 11, 2023

GeorgePearse commented Apr 12, 2023 •

edited

Loading

Weizhongjin commented Apr 23, 2023

oylz commented Jun 28, 2023

Dratlan commented Aug 1, 2023

kobic8 commented Aug 10, 2023

Decreasing inference time on cpu #31

Decreasing inference time on cpu #31

Comments

M-Zubair10 commented Apr 9, 2023

SlongLiu commented Apr 11, 2023

GeorgePearse commented Apr 12, 2023 • edited Loading

Weizhongjin commented Apr 23, 2023

oylz commented Jun 28, 2023

Dratlan commented Aug 1, 2023

kobic8 commented Aug 10, 2023

GeorgePearse commented Apr 12, 2023 •

edited

Loading