Rethink usage pattern for pretrained models #200

nateraw · 2020-09-10T22:19:44Z

🚀 Feature

Switch to using SomeModel.from_pretrained('pretrained-model-name') for pretrained models.

Motivation

Seems we are following torchvision's pattern of having a 'pretrained' argument in the init of our models to initialize a pretrained model. In my opinion, this is extremely confusing as it makes the other init args + kwargs ambiguous/useless.

Pitch

add .from_pretrained classmethod to models and initialize an instance of the class based off of that. Pretrained models should incorporate any hparams needed to fill out init, I guess.

from pl_bolts.models import VAE

model = VAE.from_pretrained('imagenet2012')

Alternatives

Additional context

The text was updated successfully, but these errors were encountered:

williamFalcon · 2020-09-11T01:27:46Z

yeah, agree... although this is basically just the same as load_from_checkpoint no? sounds like we're looking for checkpoint nicknames instead?

doesn't it read better as:

VAE.pretrained_on('xyz')

nateraw · 2020-09-11T01:42:15Z

Right, I think the distinction here is that load_from_checkpoint is for checkpoints you have saved locally, but this function would be for pretrained models that we are hosting (i.e. these guys).

So, yes! We are looking for something that can point to a nickname/identifier for a pretrained model.

I think 'pretrained_on' is a limiting name, as a model could be pretrained on the same dataset twice w/ different settings, and then would be ambiguous to load if using that function name. Thats why I suggest something a little more open, such as from_pretrained(identifier).

This is just my opinion... I could be convinced otherwise haha 😄 . Let's have others weigh in to come to consensus.

CC: @PyTorchLightning/core-contributors

williamFalcon · 2020-09-11T01:57:22Z

oh i see. it's an id not a dataset.
yeah that works.

for instance we can have many backbones with different datasets as well

CPC.from_pretrained('resnet18-imagenet')
CPC.from_pretrained('resnet50-imagenet')
CPC.from_pretrained('resnet18-stl10')

Borda · 2020-09-11T13:32:04Z

Yes, they are trained on a defined dataset, in this case, the dataset name serves just as Look-up-table to a specific path on PL side...

ananyahjha93 · 2020-09-12T19:25:57Z

@williamFalcon @Borda @nateraw I included this pattern in the latest AE, VAE commits to bolts. Few points that I realized:

We can shift the method from_pretrained() as a method to override in Lightning itself.
from_pretrained() needs to be an instance method and not a static method. In most cases, you will initialize the lightning module with specific params according the the weights being loaded.

vae = VAE(input_height=32, first_conv=True)
vae = vae.from_pretrained('cifar10-resnet18')

In this example stl10 weights have a different configuration for the encoder of the VAE. But, at the same time the internal method has a strict=False flag while loading so that users can load stl10 weights to the encoder configuration of cifar10 dataset.

Having this pattern allows us to test the correct loading of weights using the from_pretrained() function. @williamFalcon cases like the corrupt ImageNet weights for CPC will be caught automatically.

I have added all of this + tests for the AE and VAE classes I have updated for bolts.

nateraw added enhancement New feature or request help wanted Extra attention is needed labels Sep 10, 2020

nateraw self-assigned this Sep 11, 2020

oke-aditya mentioned this issue Sep 11, 2020

Re usable Components and CNN Trainer #203

Closed

Borda unassigned nateraw Nov 9, 2020

oke-aditya mentioned this issue Nov 25, 2020

Adds Backbones to Faster RCNN #382

Closed

7 tasks

akihironitta mentioned this issue Jan 29, 2021

Slightly incorrect documentation for VAE #543

Closed

jeremyadamsfisher mentioned this issue Feb 13, 2021

Update AE and VAE documentation #557

Merged

8 tasks

akihironitta mentioned this issue Mar 20, 2021

missing classmethod decorator on from_pretrained methods #597

Closed

oke-aditya mentioned this issue Mar 31, 2021

Add RetinaNet Object detection with Backbones #529

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rethink usage pattern for pretrained models #200

Rethink usage pattern for pretrained models #200

nateraw commented Sep 10, 2020

williamFalcon commented Sep 11, 2020

nateraw commented Sep 11, 2020

williamFalcon commented Sep 11, 2020

Borda commented Sep 11, 2020

ananyahjha93 commented Sep 12, 2020 •

edited

Loading

Rethink usage pattern for pretrained models #200

Rethink usage pattern for pretrained models #200

Comments

nateraw commented Sep 10, 2020

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

williamFalcon commented Sep 11, 2020

nateraw commented Sep 11, 2020

williamFalcon commented Sep 11, 2020

Borda commented Sep 11, 2020

ananyahjha93 commented Sep 12, 2020 • edited Loading

ananyahjha93 commented Sep 12, 2020 •

edited

Loading