Central function to canonicalize state dicts #40

RunDevelopment · 2023-11-22T14:17:13Z

I wanted to add support for another arch today and noticed the pretrained models are checkpoints saved as .pth files. Since they are .pth files, our code for simplifying .ckpt files does not run, and the loaded state dict is a mess.

So I combined the code for cleaning up .ckpt files and the code for unwrapping nested dicts into one function: canonicalize_state_dict. The job of this function to bring all state dicts into a common form.

Open question: Should this function be public? The load functions of individual archs expect a canonicalized state dict, so users must go through an ArchRegistry if they load .pth (or similar) files themselves. Passing model.state_dict() into a load function will continue to work though.

joeyballentine · 2023-11-22T14:37:16Z

Should this function be public?

Sure I guess. I don't see why it shouldn't be

RunDevelopment · 2023-11-22T14:58:49Z

Another question: Should ModelLoader.load_state_dict_from_file return a canonicalized state dict? Yes, no, should there be a parameter canonicalized: bool, what should the default for that parameter be?

I would like the following code to always work:

state = ModelLoader().load_state_dict_from_file(file)
model = SomeArch.load(state)

So I think we should make the function load_state_dict_from_file(self, path: str | Path, canonicalized: bool = True). What do you think?

joeyballentine · 2023-11-22T15:29:32Z

If you wanna load a state dict without doing anything special, you can just use torch.load(). Our stuff should always return usable state dicts, otherwise what's the point.

At least, that's my opinion

RunDevelopment · 2023-11-22T15:31:17Z

We do handle different file formats, but I agree with your argument. Then let's just say that load_state_dict_from_file always returns a canonicalized state dict. We can add a parameter to control this behavior later if needed.

RunDevelopment added 2 commits November 22, 2023 15:05

Central function to canonicalize state dicts

5e50fec

fixes

5294317

Make public

ffcaf26

Return a canonicalized state dict

5725272

joeyballentine approved these changes Nov 22, 2023

View reviewed changes

joeyballentine merged commit 49f4494 into main Nov 22, 2023
7 checks passed

joeyballentine deleted the canonicalize_state_dict branch November 22, 2023 15:41

RunDevelopment mentioned this pull request Mar 25, 2024

Error-tolerant loading for .pt files #215

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Central function to canonicalize state dicts #40

Central function to canonicalize state dicts #40

RunDevelopment commented Nov 22, 2023

joeyballentine commented Nov 22, 2023

RunDevelopment commented Nov 22, 2023

joeyballentine commented Nov 22, 2023

RunDevelopment commented Nov 22, 2023 •

edited

Loading

Central function to canonicalize state dicts #40

Central function to canonicalize state dicts #40

Conversation

RunDevelopment commented Nov 22, 2023

joeyballentine commented Nov 22, 2023

RunDevelopment commented Nov 22, 2023

joeyballentine commented Nov 22, 2023

RunDevelopment commented Nov 22, 2023 • edited Loading

RunDevelopment commented Nov 22, 2023 •

edited

Loading