onnx model export for vision encoder and text encoder model to get embedding #34

1093842024 · 2024-02-26T09:30:16Z

could you update onnx model export scripts for vision encoder and text encoder model to get embedding, thanks

1093842024 · 2024-02-26T11:32:12Z

what'more, for model pretrain with frame_num=4, can I change frame_num to 6 or other numbers to load the model and do inference?

Andy1621 · 2024-04-07T08:35:05Z

For ONNX, it's not created by me. You can ask the authors for help.

For more frames, you need to interpolate the temporal position embedding.

Provide feedback