Adding Image Dataloaders and Flax Resnet18 model #1

shreyaspadhy · 2022-06-10T23:54:22Z

Adding the first combined snippets of code to this repo! This PR contains -

Image dataloaders from learning-invariances, with definitions of default transformations.
Defining a big METADATA dict in jaxutils.data.image that should contain all the necessary mean, std, size, num_datapoints information for all datasets.
Adding a model definition of Resnet18, and a conversion script that converts Pytorch resnet18 models from bayesian-lottery-tickets to Flax.

JamesAllingham

Mostly LGTM! Just a few minor suggestions/questions for the most part.

Also, I assume that you didn't mean to upload the __pycache__ folder?

And I think that the img_resnets.py can just be renamed to resnets.py, since a) there are no other kinds of resnets (yet?), and b) by default I think it is assumed that resnets will be the convolutional kind. But that is also minor.

JamesAllingham · 2022-06-13T16:16:26Z

data/image.py

@@ -0,0 +1,227 @@
+"""Image dataset functionality, borrowed from https://github.com/JamesAllingham/learning-invariances/blob/main/src/data/image.py."""


It isn't really necessary to put this here! As I mentioned, this is code that has been adapted by me and Javi over the course of our PhDs, so this repo isn't really the origin.

JamesAllingham · 2022-06-13T16:17:11Z

data/image.py

+        'CIFAR10': 10_000,
+        'CIFAR100': 10_000,
+    },
+    'mean': {


Nice addition!

JamesAllingham · 2022-06-13T16:17:25Z

data/image.py

+}
+
+
+TRAIN_TRANSFORMATIONS = {


This is also good!

JamesAllingham · 2022-06-13T16:20:06Z

data/image.py

+    if flatten_img:
+        common_transforms += [Flatten()]
+
+    # Important when fitting linear model and sample-then-optimise


Is this just to say that you need augmentations for this project, or is there something more to this comment? :)

I added some more detail. Basically, we need to not use augmented train data, when training for the linear mode, and for sampling from the posterior. This was actually a bug I'd chased down, where if we augment, then the sampling is misspecified, and we never converge.

JamesAllingham · 2022-06-13T16:25:34Z

data/image.py

+    if perform_augmentations:
+        train_augmentations = TRAIN_TRANSFORMATIONS[dataset_name]
+    else:
+        train_augmentations = TEST_TRANSFORMATIONS[dataset_name]


For the non-imagenet cases, this makes sense since the test augementations are empty, but for imagenet does it make sense to be applying transformations when the user of the function has set perform_augmentations=False?

The reason we need Resize(256) and CenterCrop(224) for Imagenet is because, by default, test images in Imagenet are all of random sizes, and not uniform. So we still need some deterministic preprocessing to ensure all images are of size 224x224x3.

JamesAllingham · 2022-06-13T16:26:17Z

data/image.py

+        random_seed: the `int` random seed for splitting the val data and
+            applying random affine transformations. (Default: 42)
+        perform_augmentations: a `bool` indicating whether to apply random
+            affine transformations to the training data. (Default: `True`)


This docstring isn't exactly accurate, since the augmentations are not limited to affine transformations?

good point, affine is wrong there. made the change.

JamesAllingham · 2022-06-13T16:28:58Z

models/img_resnets.py

+    y = self.conv(self.filters, (3, 3), padding=((1, 1), (1, 1)))(y)
+    # ^ Not using Flax default padding since it doesn't match PyTorch
+
+    # For pretrained bayesian-lottery-tickets models, don't init with 0 here.


Maybe this should be a flag that can be set by the user (if it is specific to BLT models)?

Will do this when I add the torchhub model conversion fns that you had.

JamesAllingham · 2022-06-13T16:33:19Z

models/lenets.py

+        return x
+
+
+class LeNetSmall(nn.Module):


Since this is very similar to LeNet, with only the sizes changing, I think this is a good place to use inheritance to reduce code duplication. Specifically, I'd change LeNet to also have a separate setup and __call__ and then when defining LeNetSmall inherit from LeNet and simply override the setup.

Sounds good! I have a doubt about inheritance. Since I'm redefining setup for LeNetSmall, I'll have to redefine self.dense here, even though it is the same. Is there a way to only overwrite a subset of class properties after inheriting within Flax?

Hmmm, maybe you can call super().setup() in your inherited class's setup function and then redefine only the attributes which need to change. However, I think this is something I've tried before and not had luck. You'll probably have to bite the bullet and have some code duplication I am afraid.

To be clear, you can definitely call super.setup() and then add new attributes, however, I think an error is thrown when you try and overwrite attributes which have been defined in the parent's setup.

OK, I've confirmed that redefining attributes when calling super.setup() inside of the child class's setup() doesn't work. E.g., trying this:

class MyNet(nn.Module): def setup(self): self.fc2 = nn.Dense(10, 10) self.fc1 = nn.Dense(10, 10) def __call__(self, x): return self.fc2(self.fc1(x)) class MyNet2(MyNet): def setup(self): super().setup() self.fc2 = nn.Dense(333, 10) def __call__(self, x): return self.fc2(self.fc1(x))

will result in this error, when trying to call init:

ValueError: Duplicate use of scope name: "fc2"

However, if you really want to you can avoid the code duplication like this:

class MyNet(nn.Module): def _partial_setup(self): self.fc2 = nn.Dense(10, 10) def setup(self): self._partial_setup() self.fc1 = nn.Dense(10, 10) def __call__(self, x): return self.fc2(self.fc1(x)) class MyNet2(MyNet): def _partial_setup(self): self.fc2 = nn.Dense(333, 10) def __call__(self, x): return self.fc2(self.fc1(x))

But I think in this case it is better to have the duplicated code, since it is only 1 line.

Thanks for trying these out, this is super useful to know!

JamesAllingham · 2022-06-13T16:33:41Z

models/lenets.py

+        self.conv2 = conv3_block(32, stride=2)
+        self.conv3 = conv3_block(32, stride=2)
+
+    @nn.compact


This decorator should be removed if you have a setup method.

Is there a way in Flax to mix and match? For example, I define self.conv{i} in the setup, but if I want to compactly define nn.Dense within the call without mentioning it in the setup? I'm curious about the syntax here.

Currently, it actually doesn't break anything to mix and match setup and compact, however, as far as I understand, this is not advertised functionality and there are plans to remove this functionality in future: google/flax#2018.

Why do you actually want to mux and match though? Seems to me that everything could be in setup here?

Ah, good to know! I wanted to be cheeky and define the Dense layer inside, but good to know it's not preferred behaviour.

Adding Image Dataloaders

5537792

shreyaspadhy requested a review from JamesAllingham June 10, 2022 23:54

shreyaspadhy added 2 commits June 11, 2022 00:56

Adding gitignore

1694e98

minor fix

f47209d

shreyaspadhy removed the request for review from JamesAllingham June 13, 2022 11:05

Adding Pytorch->Jax conversion for resnet18

ed083bf

shreyaspadhy changed the title ~~Adding Image Dataloaders~~ Adding Image Dataloaders and Flax Resnet18 model Jun 13, 2022

shreyaspadhy requested a review from JamesAllingham June 13, 2022 14:57

Added LeNet and model conversions

af5695c

JamesAllingham approved these changes Jun 13, 2022

View reviewed changes

shreyaspadhy added 2 commits June 13, 2022 18:06

comment responses, lenetsmall subclassed

92b3f5c

rename img_resnets to resnets, convert_utils

76034f3

shreyaspadhy merged commit 23fdd4e into main Jun 13, 2022

shreyaspadhy deleted the dataloading branch October 10, 2022 19:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Image Dataloaders and Flax Resnet18 model #1

Adding Image Dataloaders and Flax Resnet18 model #1

shreyaspadhy commented Jun 10, 2022 •

edited

Loading

JamesAllingham left a comment •

edited

Loading

JamesAllingham Jun 13, 2022

shreyaspadhy Jun 13, 2022

JamesAllingham Jun 13, 2022

JamesAllingham Jun 13, 2022

JamesAllingham Jun 13, 2022

shreyaspadhy Jun 13, 2022

JamesAllingham Jun 13, 2022

shreyaspadhy Jun 13, 2022

JamesAllingham Jun 13, 2022

shreyaspadhy Jun 13, 2022

JamesAllingham Jun 13, 2022

shreyaspadhy Jun 13, 2022

JamesAllingham Jun 13, 2022

shreyaspadhy Jun 13, 2022

JamesAllingham Jun 13, 2022

JamesAllingham Jun 13, 2022

JamesAllingham Jun 13, 2022 •

edited

Loading

shreyaspadhy Jun 13, 2022

JamesAllingham Jun 13, 2022

shreyaspadhy Jun 13, 2022

JamesAllingham Jun 13, 2022

shreyaspadhy Jun 13, 2022

		@@ -0,0 +1,227 @@
		"""Image dataset functionality, borrowed from https://github.com/JamesAllingham/learning-invariances/blob/main/src/data/image.py."""

		}


		TRAIN_TRANSFORMATIONS = {

Adding Image Dataloaders and Flax Resnet18 model #1

Adding Image Dataloaders and Flax Resnet18 model #1

Conversation

shreyaspadhy commented Jun 10, 2022 • edited Loading

JamesAllingham left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JamesAllingham Jun 13, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shreyaspadhy commented Jun 10, 2022 •

edited

Loading

JamesAllingham left a comment •

edited

Loading

JamesAllingham Jun 13, 2022 •

edited

Loading