-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Visualization of cost map #40
Comments
Hello, I just started to learn computer vision. I tried to write a cost volume visualization code by myself, but the effect is not very good. Have you realized the visualization of the cost volume? Or do you have any other ideas, thank you. |
Hi, I referred to the visualization method for image-text similarity in CLIP-surgery. Since cost_map is the result of the cosine similarity between images and text, I think overlaying cost_map onto the original image is the visualization result. BTW, would you consider releasing your codes as we can take a look together to see how it’s done? |
Hello sir, thank you very much for your reply, my method is very simple, first of all, image cropping, after CLIP's image and text encoder, calculate the cosine similarity of each local map and text prompt respectively, to generate the heat map, but this is very easy to be affected by the color prompt, e.g. a woman's hair will also be hotter with ‘Black’ in the text prompt. thank you for the reminder, I'm going to learn about CLIP_Surgery's implementation next! thanks again for your reply, have a nice day!
Here is original imageHere is Heat Map |
Hi, thanks for your released code. Regarding your questions:
|
Thank you for your reply and answer. I am still in the process of learning. Have a nice day🌹🌹 |
For the aggregated cost volume, we show the output of our model, hence has a higher resolution of 96x96. We simply apply bilinear upsampling to overlay with the image.
I don't have the code at the moment, but the visualized figures are normalized with min-max with some scaling for visual clarity, as the model output does not necessarily match the scale with the initial cost volume. This would probably be enough to reproduce the figure, but please let me know if you need more details.
Originally posted by @hsshin98 in #6 (comment)
Do you have the code to visualize the cost map now ? How to reproduce it
The text was updated successfully, but these errors were encountered: