You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I wonder whether you have tried OpenFlamingo on NLVR^2. The input in NLVR^2 is always a pair of images along with a question and the output is either true or false.
An example
I am not sure whether the in-context setting in OpenFlamingo supports such paired-image input as demonstration. Shoud I use a pair of <image> to indicate these two images in one example?
Do you have any comments?
Best.
The text was updated successfully, but these errors were encountered:
The way I would format the input would be to do <image><|endofchunk|><image>The left image contains twice the number of dogs as the right image, and at least two dogs in total are standing.. One limitation of doing so is that you would get less 'signal' from the first image and the text can only attend to the immediate image. Maybe one thing you can explore is combining the images and passing them in as a single image?
Hi there,
thank you for your awesome work!
I wonder whether you have tried OpenFlamingo on NLVR^2. The input in NLVR^2 is always a pair of images along with a question and the output is either
true
orfalse
.An example
I am not sure whether the in-context setting in OpenFlamingo supports such paired-image input as demonstration. Shoud I use a pair of
<image>
to indicate these two images in one example?Do you have any comments?
Best.
The text was updated successfully, but these errors were encountered: