Fashion MNIST using improve unified interface #108

rajeeja · 2024-01-18T21:20:23Z

Demo example for new community models.
This is a first draft that uses IMPROVE for initialize parameters and candle for checkpointing.

…in 2. Infer

…ROVE compliant in the .py files

wilke · 2024-01-18T21:31:07Z

test/fashion-mnist/infer.py

+   # Get the data directory, batch size and other hyperparameters from params 
+   ##IMPROVE 
+   batch_size = params["batch_size"]
+   learning_rate = params["learning_rate"]


Why are these parameters for inference?

@rajeeja I agree with @wilke, it's weird that these parameters are defined in inference script. Generally, inference is done without knowledge of training settings (e.g., train batch size, learning rate optimizer, etc.). Is there a reason why these are defined here?

@adpartin @wilke

we need to load the test data in batches -> Hence - batch_size (see line 50)

I'm getting the model from the ckpt method, we can get it via other means also. The ckpt object needs optimizer to instantiate -> Hence - optimizer (see line 64)

Sorry, but it is unclear why you must load the data for inference in batches. Is this in any way faster than a simple

for v in File label=infer(v)

There should be a better way of loading the model. We want the model from the input directory. No optimizer is needed. If this is a problem, I suggest writing a load_model_weights function as a wrapper.

@rajeeja @adpartin If we use this as an example, we have to make it clean. This is a great hack but not a sustainable solution. Please come up with a better solution or hide it in a function call. These are constants in this case.

wilke · 2024-01-18T21:32:10Z

test/fashion-mnist/infer.py

+
+   # NOTE: using false now for data loading
+   testset = torchvision.datasets.FashionMNIST(root=dataset_dir, train=False, download=False, transform=transform)
+   testloader = torch.utils.data.DataLoader(testset, batch_size=batch_size, shuffle=True)


Loading test data for inference? I would expect to load the model weights.

See line 72, how are you going to get images to test?
outputs = model(images)

I don't understand, or is this a problem of naming conventions? We are doing label prediction in this script no testing. Do you have a specific use case in mind?

@rajeeja Any thoughts?

wilke · 2024-01-18T21:32:26Z

test/fashion-mnist/infer.py

+   model = Net().to(device)
+
+   # Define optimizer
+   optimizer = optim.SGD(model.parameters(), lr=learning_rate, momentum=momentum)


same as above :) we can get the model weights in some other fashion, do you know how else to get the model weights to perform inferece? - where optimizer or learning rate is not needed, it is a minor thing and can be ignored, IMO. The overall logic is to get the model weights and infer on what was done in the training step.

I agree, get the model weights and infer on any input data in the input data directory. The model weights should be located there as well. The outputs of training are model weights and learning metrics.

wilke · 2024-01-18T21:34:05Z

test/fashion-mnist/infer.py

+
+## IMPROVE
+# Note some of these are similar to previous section and may be adjusted as per model requirements
+model_infer_params = [


Which parameter is for loading model weights?

I'm loading it from the ckpt files, so no specific parameter.

this can be done by using a specific directory to save the model weights and load from there, something along the lines of test_ml_data_dir

wilke · 2024-01-18T21:36:05Z

test/fashion-mnist/train.py

+## IMPROVE
+def run(params):
+##
+   # # Define transformations for data preprocessing


If this is for data preprocessing, please move it to the preprocessing script.

trains on 60k images, but there are 10k images we need to infer on, those we have to get here. This is load not the entire preprocessing.

wilke · 2024-01-18T21:37:19Z

test/fashion-mnist/train.py

+      loss = running_loss / len(trainloader)
+      ckpt.ckpt_epoch(epoch, loss)
+   print('Training finished.')
+


I don't see where you export the final weights.

ckpt, does that for you, if you run it once, you will see it in the directory.
see ckpt_epoch line 94 and documentation here: https://candle-lib.readthedocs.io/en/latest/api_ckpt_pytorch_utils/_autosummary/candle.ckpt_pytorch_utils.CandleCkptPyTorch.html

The final weights should be in the top-level output directory.

adpartin · 2024-01-18T23:39:15Z

@rajeeja @wilke The code structure doesn't follow the IMPROVE structure, so it's hard to go through the code and fix everything. https://jdacs4c-improve.github.io/docs/content/unified_interface.html

rajeeja · 2024-01-19T00:02:10Z

@rajeeja @wilke The code structure doesn't follow the IMPROVE structure, so it's hard to go through the code and fix everything. https://jdacs4c-improve.github.io/docs/content/unified_interface.html

That link is not helpful. The data is different. This model doesn't use genetics or drug data, so, the structure won't be exactly the same. The .py files are very simple and easy to understand. Also the notebook is very standard fashion-mnist.

… created

rajeeja added 3 commits January 18, 2024 10:18

o Add basic fashion mnist and broken down parts 1. Preprocess, 2. Tra…

6feca7a

…in 2. Infer

o Add more to make things IMPROVE compliant WIP

231691a

o Add fashion MNIST regular community version in the notebook and IMP…

5dc5f4f

…ROVE compliant in the .py files

rajeeja requested review from brettin, wilke and adpartin January 18, 2024 21:20

o Add candle-lib docs reference for ckpt

6d93c27

wilke requested changes Jan 18, 2024

View reviewed changes

rajeeja added 2 commits January 18, 2024 19:37

o missed the params file

f95a917

o Add directory creation, this needs to follow IMPROVE_DATA_DIR issue…

81e56a3

… created

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fashion MNIST using improve unified interface #108

Fashion MNIST using improve unified interface #108

rajeeja commented Jan 18, 2024

wilke Jan 18, 2024

adpartin Jan 18, 2024

rajeeja Jan 19, 2024

wilke Jan 23, 2024

wilke Jan 23, 2024

wilke Jan 29, 2024

wilke Jan 18, 2024

rajeeja Jan 19, 2024

wilke Jan 23, 2024

wilke Jan 29, 2024

wilke Jan 18, 2024

rajeeja Jan 19, 2024

wilke Jan 29, 2024

wilke Jan 18, 2024

rajeeja Jan 19, 2024

wilke Jan 18, 2024

rajeeja Jan 19, 2024

wilke Jan 18, 2024

rajeeja Jan 19, 2024

wilke Jan 29, 2024

adpartin commented Jan 18, 2024

rajeeja commented Jan 19, 2024

Fashion MNIST using improve unified interface #108

Are you sure you want to change the base?

Fashion MNIST using improve unified interface #108

Conversation

rajeeja commented Jan 18, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adpartin commented Jan 18, 2024

rajeeja commented Jan 19, 2024