the pretrained model weights #4

runzeer · 2022-05-07T02:19:04Z

any pretrained model weights released?

sharifza · 2022-05-07T18:54:12Z

No. But maybe it's possible to make it work with the pretrained OPT weights recently released by facebook and the pretrained CLIP weights.

lucidrains · 2022-05-08T00:30:28Z

No. But maybe it's possible to make it work with the pretrained OPT weights recently released by facebook and the pretrained CLIP weights.

great idea :) will still need to train the cross attention blocks (perceiver + gated), but should be doable

TheodoreGalanos · 2022-05-08T05:48:36Z

It would really be great to somehow allow loading pretrained weights for one (some?) of the available pretrained models.

Do you think it's really difficult (and unintuitive) to smh allow pulling weights from arbitrary pretrained models (to an extent, smth like GPTJ, GPTNeo(X), GPT2, etc.)?

LITDataScience · 2022-05-28T06:29:26Z

Are we planning to release any pretrained model weights in the future?

edmondja · 2022-09-10T12:59:51Z

Its actually not a good idea since CLIP is not task agnostic, as explained in CLIP's paper tasks with non natural images like EuroSAT perform poorly. So you would be basically making Flamingo look like a model being one year older.
" Flamingo achieves state-of-the-art performance across a wide range of benchmarks
without training on commonly used and curated datasets such as VQAv2, COCO or ImageNet. Instead,
Flamingo is trained solely on task-agnostic web scraped data."

sharifza · 2022-09-10T15:51:54Z

Its actually not a good idea since CLIP is not task agnostic, as explained in CLIP's paper tasks with non natural images like EuroSAT perform poorly. So you would be basically making Flamingo look like a model being one year older. " Flamingo achieves state-of-the-art performance across a wide range of benchmarks without training on commonly used and curated datasets such as VQAv2, COCO or ImageNet. Instead, Flamingo is trained solely on task-agnostic web scraped data."

Flamingo's encoder backbone is trained in a similar approach to CLIP with contrastive text-image training (ref to Section 3 of the paper). The data from CLIP is also scraped from the web, I think in a very similar way to Flamingo. Therefore, the training process of Flamingo or the data that it uses is not necessarily more task agnostic than CLIP.

If you want to have a proper task agnostic backbone, it's probably better to use a backbone trained in a self-supervised approach similar to BYOL. Nevertheless, the type of data is important. For example, you mentioned X-ray in the other thread, which could be quite different than typically scraped data from the web.

edmondja · 2022-09-10T16:13:23Z

I tried this but i counted on flamingo to help me to do few short learning because my dataset is not a per se real dataset as we can imagine it but more few examples of xray images

sharifza · 2022-09-10T18:40:12Z

You might be able to use Flamingo for your use case by prompting (I'm not sure if it works but its worth a try). By prompting I mean that u give it one x-ray image, describe what's important in the image, give it a second image and ask your query.

I think this leads us out of the topic here, if you want we can discuss this in another thread.

Ellyuca · 2023-03-20T13:20:34Z

Any news on the model weights?
Thanks.

runzeer closed this as completed May 7, 2022

runzeer reopened this May 7, 2022

runzeer changed the title ~~result for the VizWiz VQA result~~ the pretrained model weights May 7, 2022

edmondja mentioned this issue Sep 10, 2022

Web scraped data pretraining dhansmair/flamingo-mini#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the pretrained model weights #4

the pretrained model weights #4

runzeer commented May 7, 2022 •

edited

Loading

sharifza commented May 7, 2022

lucidrains commented May 8, 2022

TheodoreGalanos commented May 8, 2022

LITDataScience commented May 28, 2022

edmondja commented Sep 10, 2022

sharifza commented Sep 10, 2022

edmondja commented Sep 10, 2022

sharifza commented Sep 10, 2022

Ellyuca commented Mar 20, 2023

the pretrained model weights #4

the pretrained model weights #4

Comments

runzeer commented May 7, 2022 • edited Loading

sharifza commented May 7, 2022

lucidrains commented May 8, 2022

TheodoreGalanos commented May 8, 2022

LITDataScience commented May 28, 2022

edmondja commented Sep 10, 2022

sharifza commented Sep 10, 2022

edmondja commented Sep 10, 2022

sharifza commented Sep 10, 2022

Ellyuca commented Mar 20, 2023

runzeer commented May 7, 2022 •

edited

Loading