About Visualization Tools #2

KaKa-101 · 2024-12-19T11:21:08Z

Thanks for your great work, which is significant for this area.
Could you please provide the visualization tool for Figure 3 ? I am really interested in the visual results of retained image patches and want to try it on my own.
Thanks a lot.

ChimpOnCloud · 2024-12-22T09:56:56Z

lol you mean fig.2? Currently we just picked random samples and tracked attention maps to get these data. You can simply get attention distribution in llava_llama.py and get [CLS] attention distribution of visual encoder in clip_encoder.py. Later we will release the complete visualization tool

KaKa-101 · 2024-12-23T02:46:03Z

Sry, actually I mean fig.4

ChimpOnCloud · 2024-12-23T04:25:00Z

For fig.4, currently we just filtered those patches with top [CLS] attention scores, and manually marked each object with different color for paper readers to see the effectiveness of pruning with [CLS] attention. We consider introducing some automatic tools like SAM to mark different objects in the near future.

KaKa-101 · 2024-12-25T16:00:25Z

Thanks for your kind reply.
Looking forward to the release of your complete visualization tools~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About Visualization Tools #2

About Visualization Tools #2

KaKa-101 commented Dec 19, 2024

ChimpOnCloud commented Dec 22, 2024

KaKa-101 commented Dec 23, 2024 •

edited

Loading

ChimpOnCloud commented Dec 23, 2024 •

edited

Loading

KaKa-101 commented Dec 25, 2024

About Visualization Tools #2

About Visualization Tools #2

Comments

KaKa-101 commented Dec 19, 2024

ChimpOnCloud commented Dec 22, 2024

KaKa-101 commented Dec 23, 2024 • edited Loading

ChimpOnCloud commented Dec 23, 2024 • edited Loading

KaKa-101 commented Dec 25, 2024

KaKa-101 commented Dec 23, 2024 •

edited

Loading

ChimpOnCloud commented Dec 23, 2024 •

edited

Loading