Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

8029 update load old weights function for diffusion_model_unet.py #8031

Merged

Conversation

yiheng-wang-nv
Copy link
Contributor

Fixes #8029 .

Description

A few sentences describing the changes proposed in this pull request.

Types of changes

  • Non-breaking change (fix or new feature that would not break existing functionality).
  • Breaking change (fix or new feature that would cause existing functionality to change).
  • New tests added to cover the changes.
  • Integration tests passed locally by running ./runtests.sh -f -u --net --coverage.
  • Quick tests passed locally by running ./runtests.sh --quick --unittests --disttests.
  • In-line docstrings updated.
  • Documentation updated, tested make html command in the docs/ folder.

Signed-off-by: Yiheng Wang <vennw@nvidia.com>
@yiheng-wang-nv
Copy link
Contributor Author

Tested with this branch.
Code:

from generative.networks.nets import DiffusionModelUNet
from monai.networks.nets.diffusion_model_unet import DiffusionModelUNet as MonaiDiffusionModelUNet
import torch

input_params = {
    "spatial_dims": 2,
    "in_channels": 1,
    "out_channels": 1,
    "num_channels": [32, 64, 128, 256],
    "attention_levels": [False, True, True, True],
    "num_head_channels": [0, 32, 32, 32],
    "num_res_blocks": 2,
}

old_network = DiffusionModelUNet(**input_params)
old_network.load_state_dict(torch.load("model.pt"))

new_params = input_params.copy()
new_params.pop("num_channels")
new_params["channels"] = input_params["num_channels"]
new_params["include_fc"] = False
new_params["use_combined_linear"] = False

new_network = MonaiDiffusionModelUNet(**new_params)
new_network.load_old_state_dict(old_network.state_dict())

test_input = torch.randn(1, 1, 64, 64).cuda()
timesteps = torch.randn(1).cuda()

old_network = old_network.cuda()
new_network = new_network.cuda()

old_network.eval()
new_network.eval()

with torch.no_grad():
    old_output = old_network(test_input, timesteps)
    new_output = new_network(test_input, timesteps)

Output:
Screenshot 2024-08-20 at 2 11 01 PM

@KumoLiu
Copy link
Contributor

KumoLiu commented Aug 20, 2024

Hi @yiheng-wang-nv, thanks for the update, could you please test https://github.com/Project-MONAI/MONAI/blob/dev/tests/test_diffusion_model_unet.py#L581? It looks like different branches lead to different blocks, and this test case doesn't seem to be able to be imported correctly in this pr.

Signed-off-by: Yiheng Wang <vennw@nvidia.com>
…g-nv/MONAI into 8029-fix-difussion-load-error
@yiheng-wang-nv
Copy link
Contributor Author

Thanks @KumoLiu . I updated the code to support both cases

Copy link
Contributor

@KumoLiu KumoLiu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the quick update, LGTM.

Co-authored-by: YunLiu <55491388+KumoLiu@users.noreply.github.com>
Signed-off-by: Yiheng Wang <68361391+yiheng-wang-nv@users.noreply.github.com>
@KumoLiu
Copy link
Contributor

KumoLiu commented Aug 20, 2024

Hi @ericspod, could you please also help review this one? Thanks!

@KumoLiu
Copy link
Contributor

KumoLiu commented Aug 20, 2024

/build

Copy link
Member

@ericspod ericspod left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks fine to me as it's passing your tests.

@ericspod ericspod merged commit cea80a6 into Project-MONAI:dev Aug 20, 2024
28 checks passed
rcremese pushed a commit to rcremese/MONAI that referenced this pull request Sep 2, 2024
…oject-MONAI#8031)

Fixes Project-MONAI#8029  .

### Description

A few sentences describing the changes proposed in this pull request.

### Types of changes
<!--- Put an `x` in all the boxes that apply, and remove the not
applicable items -->
- [x] Non-breaking change (fix or new feature that would not break
existing functionality).
- [ ] Breaking change (fix or new feature that would cause existing
functionality to change).
- [ ] New tests added to cover the changes.
- [ ] Integration tests passed locally by running `./runtests.sh -f -u
--net --coverage`.
- [ ] Quick tests passed locally by running `./runtests.sh --quick
--unittests --disttests`.
- [ ] In-line docstrings updated.
- [ ] Documentation updated, tested `make html` command in the `docs/`
folder.

---------

Signed-off-by: Yiheng Wang <vennw@nvidia.com>
Signed-off-by: Yiheng Wang <68361391+yiheng-wang-nv@users.noreply.github.com>
Co-authored-by: YunLiu <55491388+KumoLiu@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

load_old_state_dict of DiffusionModelUNet is wrong
3 participants