How to do Instance Retrieval like the demo? #7

mau-io · 2023-04-18T05:32:28Z

Issue Title: How to do Instance Retrieval like the demo?

Issue Description:

Hello, I have recently come across the demo on instance retrieval in this repository, and I'm very interested in implementing a similar feature in my own project. However, I'm having some difficulties understanding the exact steps and required components to achieve this.

Can you please provide some guidance or documentation on how to replicate the instance retrieval functionality as demonstrated in the demo? Specifically, I'd like to know about the following:

Which part of the codebase is responsible for instance retrieval?
Any examples or tutorials that could help me better understand the implementation?

I appreciate any help or direction you can provide. Thank you!

MarcSzafraniec · 2023-04-18T12:03:17Z

Hi ! To answer your question: the part for instance retrieval is not public, so it is not in this code base.

However, the steps to reproduce it are pretty simple: if you have a dataset that you want to retrieve in ("base dataset") and a "query" image, what you need to do is:

first embed every image in the base dataset with the forward method of one of our models. That will give you a tensor for each image
You can transform these tensors in a numpy array, then in a faiss index
Finally, embed the query image with the same model, and query it in the faiss index

It should not be very complicated once you know how to use faiss ! And if the base dataset is relatively small, it should be pretty fast

patricklabatut · 2023-04-18T20:49:24Z

the part for instance retrieval is not public

The k-NN evaluation code does provide functionality close to that though (with a k-NN classifier).

To complement @MarcSzafraniec's answer with a possibly higher-level description of the retrieval process:

first, map all the images (both query images and database images) to feature vectors representing these images. At the most basic level, this is done by just applying the model - see this function in the evaluation code,
then, for each query image and their respective feature vector, determine the closest database images by searching for the closest database features vectors (closest being according to some similarity measure, e.g. cosine similarity). This can be done in vanilla PyTorch (by computing the cosine similarity, i.e. the dot product between normalized query and database vectors and extracting the top-k database vectors).

For some use cases (e.g. the demo), the above could (and should) be done (partially) offline by first indexing of all the database feature vectors to facilitate subsequent queries. And then querying this index on each request. As noted above, the Faiss library can be used to implement such indexing and search logic.

sherylwang · 2023-04-20T12:22:13Z

Hi ! To answer your question: the part for instance retrieval is not public, so it is not in this code base.

However, the steps to reproduce it are pretty simple: if you have a dataset that you want to retrieve in ("base dataset") and a "query" image, what you need to do is:

first embed every image in the base dataset with the forward method of one of our models. That will give you a tensor for each image

You can transform these tensors in a numpy array, then in a faiss index

Finally, embed the query image with the same model, and query it in the faiss index

It should not be very complicated once you know how to use faiss ! And if the base dataset is relatively small, it should be pretty fast

It seems there will be 3 output features "x_norm_clstoken","x_norm_patchtokens" and "x_prenorm" which should be used for retrieval?

woctezuma · 2023-04-20T15:38:35Z

For image retrieval, you can directly use :

dinov2/dinov2/eval/utils.py

Line 122 in fc49f49

features_rank = model(samples).float()

This would probably be the equivalent of the CLS token, so x_norm_clstoken which you mentioned.

See:

abdelkareemkobo · 2023-05-09T16:39:50Z

This is may help small demo

patricklabatut · 2023-05-11T21:22:06Z

Closing and keeping track as part of #95.

patricklabatut added the documentation Improvements or additions to documentation label Apr 18, 2023

patricklabatut self-assigned this Apr 18, 2023

woctezuma mentioned this issue Apr 24, 2023

How to evaluate on image retrieval datasets using nearest neighbors? #50

Open

patricklabatut mentioned this issue May 11, 2023

[request] Image retrieval documentation / example #95

Open

patricklabatut closed this as completed May 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to do Instance Retrieval like the demo? #7

How to do Instance Retrieval like the demo? #7

mau-io commented Apr 18, 2023

MarcSzafraniec commented Apr 18, 2023

patricklabatut commented Apr 18, 2023 •

edited

Loading

sherylwang commented Apr 20, 2023 •

edited

Loading

woctezuma commented Apr 20, 2023 •

edited

Loading

abdelkareemkobo commented May 9, 2023

patricklabatut commented May 11, 2023

How to do Instance Retrieval like the demo? #7

How to do Instance Retrieval like the demo? #7

Comments

mau-io commented Apr 18, 2023

Issue Title: How to do Instance Retrieval like the demo?

Issue Description:

MarcSzafraniec commented Apr 18, 2023

patricklabatut commented Apr 18, 2023 • edited Loading

sherylwang commented Apr 20, 2023 • edited Loading

woctezuma commented Apr 20, 2023 • edited Loading

abdelkareemkobo commented May 9, 2023

patricklabatut commented May 11, 2023

patricklabatut commented Apr 18, 2023 •

edited

Loading

sherylwang commented Apr 20, 2023 •

edited

Loading

woctezuma commented Apr 20, 2023 •

edited

Loading