-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
examples : add sample SAM inference #74
Conversation
d7a0b07
to
e3f7f04
Compare
a3443b2
to
4556c35
Compare
30e0e73
to
6f40d90
Compare
2a606f2
to
608d82c
Compare
102102c
to
ea8ffe9
Compare
* Add loading of decoder layers in Model * Multiply by hypernet_layer_cnt for ctx_size on model load * Add decoder layers to py conversion script * Fix wrong and reversed tensor sizes for decoder * Add decoder transformer implementation * Add decoder hypernet and iou prediction mlps * Add transpose convolution operation and unit test * Finish mask decoder and write the decoder output in the model state * Output masks to png after removing padding and upsampling to original size - Also filter based on the iou treshold - Additionally filtering based on the stability score and crop boxes should be done * Add stb image write in order to output masks from SAM * Add transpose convolution 2d name and symbol to ggml ops static arrays * Comment out debug print in transpose convolution test to fix compilation ggml-ci
Add README.md with instructions for obtaining and converting the model and we can merge it. We can continue optimizations from
|
Additionally I think we should:
Aded all those next steps to the README and I am going to start working on them |
@ggerganov I just stumbled upon your SAM code and this comment:
|
@cmp-nct Ah good to know - thanks for looking into this |
Initial version: #418 (comment)
PTH tensors for ViT-B