Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

code #6

Open
lclszsdnr opened this issue Nov 28, 2024 · 1 comment
Open

code #6

lclszsdnr opened this issue Nov 28, 2024 · 1 comment

Comments

@lclszsdnr
Copy link

您好,请问我没有找到训练文件train_clip_vg.py。此外,我想请教一下您的这篇优异工作本质上也是利用语言信息影响视觉特征的提取对吗,类似于qrnet、VG-LAW等工作

@linhuixiao
Copy link
Owner

@lclszsdnr I'm sorry for not having had time to tidy up the code due to a series of work delays. At present, I have released all the models and code associated with this paper. Regarding your second question, yes, this paper is essentially a further exploration of language-guided visual grounding models such as TransVG++, QRNet, and VG-LAW etc..

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants