ENH: Enable Global Similarity Comparison Between Text and Image Inputs #606

ozan-oktay · 2022-09-14T18:45:04Z

#604
Proposed changes:

Image inference engine returns global image embeddings if user needs to compute image and sentence level alignment.
Similarly, VLP engine can use this method to return a single cosine similarity score as in CLIP method.

hi-ml-multimodal/src/health_multimodal/image/inference_engine.py

fepegar

Thanks, @ozan-oktay. It would be best to make the methods slightly more modular: one function to load, one function to transform, one function to compute embeddings. This will maximise flexibility and simplify testing. I'm happy to work on that if you like.

ozan-oktay · 2022-09-15T10:54:57Z

Thanks, @ozan-oktay. It would be best to make the methods slightly more modular: one function to load, one function to transform, one function to compute embeddings. This will maximise flexibility and simplify testing. I'm happy to work on that if you like.

@fepegar The PR is not introducing any new loading or transformation functions -- It's using the existing class methods to return a global embedding that is similar to patch embedding computation. It also adds a separate method in VLP inference engine to return a similarity score. Please feel free to create a separate issue for the idea you have in mind, and if there is something particularly applicable within this context, please let me know. Thanks.

hi-ml-multimodal/src/health_multimodal/image/inference_engine.py

hi-ml-multimodal/src/health_multimodal/vlp/inference_engine.py

hi-ml-multimodal/test_multimodal/vlp/test_vlp_inference_engine.py

for more information, see https://pre-commit.ci

…icrosoft/hi-ml into ozoktay/multimodal_global_similarities

add global similarity comparison between text and image inputs

1107120

ozan-oktay assigned Shruthi42 and fepegar Sep 14, 2022

rename the file to address pytest issue

7743472

Shruthi42 approved these changes Sep 15, 2022

View reviewed changes

hi-ml-multimodal/src/health_multimodal/image/inference_engine.py Outdated Show resolved Hide resolved

fepegar reviewed Sep 15, 2022

View reviewed changes

ozan-oktay closed this Sep 15, 2022

ozan-oktay requested a review from ant0nsc September 15, 2022 10:55

ozan-oktay reopened this Sep 15, 2022

update method naming -- PR comment

ccbcdf8

fepegar requested changes Sep 15, 2022

View reviewed changes

Ozan Oktay and others added 5 commits September 15, 2022 06:00

add support for multi prompt similarities

e4d2281

update tests

5ef2aa7

[pre-commit.ci] auto fixes from pre-commit.com hooks

f4de457

for more information, see https://pre-commit.ci

update function naming

00e258c

Merge branch 'ozoktay/multimodal_global_similarities' of github.com:m…

d539655

…icrosoft/hi-ml into ozoktay/multimodal_global_similarities

ozan-oktay removed the request for review from ant0nsc September 15, 2022 13:23

Improve method docstring

4f91ee4

fepegar approved these changes Sep 15, 2022

View reviewed changes

Ozan Oktay added 4 commits September 15, 2022 11:31

fix the issue with normalising embeddings twice

fa01961

Merge branch 'ozoktay/multimodal_global_similarities' of github.com:m…

fd5c10f

…icrosoft/hi-ml into ozoktay/multimodal_global_similarities

add initial version of zero-shot classification

13d9b90

finalise the zero-shot classification test

c62590d

fepegar enabled auto-merge (squash) September 16, 2022 10:53

fepegar mentioned this pull request Sep 16, 2022

"Expected" tests are not running #609

Closed

Ozan Oktay added 2 commits September 16, 2022 04:28

update Readme - mentioning zero shot classification

63e68a4

update the example info -- no commercial or clinical use

eb554de

fepegar merged commit 9ccb647 into main Sep 16, 2022

fepegar deleted the ozoktay/multimodal_global_similarities branch September 16, 2022 11:48

ozan-oktay mentioned this pull request Sep 16, 2022

Add support to compute embeddings and similarities #604

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Enable Global Similarity Comparison Between Text and Image Inputs #606

ENH: Enable Global Similarity Comparison Between Text and Image Inputs #606

ozan-oktay commented Sep 14, 2022

fepegar left a comment

ozan-oktay commented Sep 15, 2022 •

edited

Loading

ENH: Enable Global Similarity Comparison Between Text and Image Inputs #606

ENH: Enable Global Similarity Comparison Between Text and Image Inputs #606

Conversation

ozan-oktay commented Sep 14, 2022

fepegar left a comment

Choose a reason for hiding this comment

ozan-oktay commented Sep 15, 2022 • edited Loading

ozan-oktay commented Sep 15, 2022 •

edited

Loading