Full compatibility with Segment Anything #1544

astnmsn · 2024-03-27T21:18:40Z

I have not found any equivalent request

Feature description

Addition of support for necessary operators to utilize the vit_b SAM Model which can be found here:

I have inspected the model using Netron and compared the nodes to the support list here

Below is the list of operators in the network that are partially or fully missing from the support list
Operator (missing support)

Mul (Import)
MatMul (Import)
Sin (Import)
Shape (Import)
Expand (Import)
Not (Import)
ReduceMean (Import)
ConstantOfShape (Full)
Where (Import)
Slice (Import)
OneHot (Import)
Tile (Import)
LayerNormalization (Import)
Gemm (Import)
ReduceMax (Import)
Floor (Full)

Feature motivation

I am currently trying to load and run an onnx model segment-anything inside an image editing app in an effort to provide a masking experience similar to the demo available here. I am integrating it into an existing rust codebase using wgpu that compiles to wasm and runs in the browser.

Before I can select burn as the ml library to support this workflow, I need to be sure that it supports the operators specified in the model.

The text was updated successfully, but these errors were encountered:

antimora · 2024-03-27T21:40:59Z

Thanks for filing this. This is helpful as we prioritize ONNX ops. If you have a direct link to the ONNX file, can you also link this?

antimora · 2024-03-27T21:46:30Z

Updating Expand to Import, since we just added this op. I need to update the docs.

antimora · 2024-03-27T21:58:25Z

Submitted a PR to fix the supported OPs document: #1547

astnmsn · 2024-03-27T22:04:40Z

This is the model download link provided by the SAM repo - I have also added to the original post

antimora · 2024-03-28T04:11:20Z

I think it might be faster to implement the model manually in Burn and load the pth weights file, which we now support.

You can check out an existing model to see how it's done: https://github.com/tracel-ai/models/tree/main/resnet-burn

We also have a YOLOX object detection PR in the works: tracel-ai/models#24

@laggui has written a great tutorial on this subject: https://dev.to/laggui/transitioning-from-pytorch-to-burn-45m

Recently, we made tons of enhancements to the PyTorchFileRecorder: https://discord.com/channels/1038839012602941528/1144670451763785769/1216788417984335872

@laggui, @nathanielsimard, @ashdtu, would this be worth implementing ourselves? Should we move this ticket to the models repo?

laggui · 2024-03-28T12:16:11Z

The community is always one step ahead 😄

We've actually discussed adding SAM to our models and this was in the plans following the release.

We still haven't decided whether we want to reimplement it and use the PyTorch file recorder to import the weights or use the ONNX import.

antimora · 2024-03-28T15:44:24Z

@laggui, if we decide to work on this, I am more inclined to adding ONNX OPs. It will be biggest bang for the buck instead of spending time to come up with the model by hand (although I am not sure how complex it is).

laggui · 2024-04-09T14:28:32Z

Btw, not sure if anyone has delved into the SAM code for ONNX export but it doesn't include all the operations to actually run the model for an input image. The encoder part is totally left out of the ONNX export and the exported ONNX model expects image embeddings as input.

In their example they still use their pytorch implementation to provide the embeddings to the ONNX runtime.

So even if we support the missing operations in this issue, SAM support will still not be complete. Is this what you expected @astnmsn?

astnmsn · 2024-04-09T15:29:05Z

@laggui Thanks for asking, and yes that is expected. We plan to run the first half of the model to generate the embeddings on the backend using pytorch. Only the second half, which produces the masks from the embeddings and the cursor/click positions, will be run on the client

antimora · 2024-04-12T18:52:54Z

Regarding Tile Op. We need to rename our current repeat op to repeat_dim and implement a proper repeat for all dimensions at once.

antimora · 2024-04-18T04:46:53Z

Resolving this ticket will resolve #1560 as well.

laggui · 2024-04-30T15:43:40Z

Current state of required ops based on the latest PRs:

op_type	Burn	Import
Add	✅	✅
Cast	✅	✅
Concat	✅	✅
Constant	✅	✅
ConstantOfShape	❌	❌
Conv	✅	✅
ConvTranspose	✅	✅
Cos	✅	✅
Div	✅	✅
Equal	✅	✅
Erf	✅	✅
Expand	✅	❌
Floor	❌	❌
Gather	✅	✅
Gemm	❌	❌
LayerNormalization	✅	✔️
MatMul	✅	✔️
Mul	✅	✅
Not	✅	✔️
OneHot	✅	❌
Pow	✅	✅
Reciprocal	✅	✅
ReduceMax	✅	✔️
ReduceMean	✅	✔️
Relu	✅	✅
Reshape	✅	✅
Resize	✅	❌
Shape	✅	✔️
Sin	✅	✔️
Slice	✅	❌
Softmax	✅	✅
Sqrt	✅	✅
Sub	✅	✅
Tile	✅	❌
Transpose	✅	✅
Unsqueeze	✅	✅
Where	✅	✔️

antimora added onnx feature The feature request labels Mar 27, 2024

antimora mentioned this issue Mar 27, 2024

Update SUPPORTED-ONNX-OPS.md #1547

Merged

antimora mentioned this issue Apr 3, 2024

ONNX conversion: Only tensor input is valid Argument #1560

Open

antimora assigned laggui Apr 12, 2024

laggui added this to Burn 🔥 Apr 19, 2024

laggui moved this to In Progress in Burn 🔥 Apr 19, 2024

laggui mentioned this issue Apr 22, 2024

Add layer norm onnx op support #1680

Merged

nathanielsimard unassigned laggui Apr 30, 2024

nathanielsimard moved this from In Progress to Todo in Burn 🔥 Apr 30, 2024

nathanielsimard moved this from Todo to In Progress in Burn 🔥 Apr 30, 2024

antimora mentioned this issue Apr 30, 2024

Help Wanted: Implementing ONNX Ops #1714

Open

39 tasks

antimora added the blocked Should not be tackled right now label May 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Full compatibility with Segment Anything #1544

Full compatibility with Segment Anything #1544

astnmsn commented Mar 27, 2024 •

edited by antimora

Loading

antimora commented Mar 27, 2024

antimora commented Mar 27, 2024

antimora commented Mar 27, 2024

astnmsn commented Mar 27, 2024 •

edited

Loading

antimora commented Mar 28, 2024

laggui commented Mar 28, 2024

antimora commented Mar 28, 2024

laggui commented Apr 9, 2024

astnmsn commented Apr 9, 2024

antimora commented Apr 12, 2024

antimora commented Apr 18, 2024

laggui commented Apr 30, 2024

Full compatibility with Segment Anything #1544

Full compatibility with Segment Anything #1544

Comments

astnmsn commented Mar 27, 2024 • edited by antimora Loading

Feature description

Feature motivation

antimora commented Mar 27, 2024

antimora commented Mar 27, 2024

antimora commented Mar 27, 2024

astnmsn commented Mar 27, 2024 • edited Loading

antimora commented Mar 28, 2024

laggui commented Mar 28, 2024

antimora commented Mar 28, 2024

laggui commented Apr 9, 2024

astnmsn commented Apr 9, 2024

antimora commented Apr 12, 2024

antimora commented Apr 18, 2024

laggui commented Apr 30, 2024

astnmsn commented Mar 27, 2024 •

edited by antimora

Loading

astnmsn commented Mar 27, 2024 •

edited

Loading