You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When I run the target detection task, I find that the input prompts are different, such as "building" and "house", and the detection results are very different. Are there any good suggestions to help me with the detection? Do I need to know your open vocabulary glossary?
The text was updated successfully, but these errors were encountered:
Hello, I think it depends on your needs and the model's actual performance under your scenarios. You may try different prompts in a set of images and compute the performance to determine the final prompt usage.
When I run the target detection task, I find that the input prompts are different, such as "building" and "house", and the detection results are very different. Are there any good suggestions to help me with the detection? Do I need to know your open vocabulary glossary?
The text was updated successfully, but these errors were encountered: