Del ort_model._modules to foward its accessing to torch_model._modules #14563

guyang3532 · 2023-02-03T14:38:40Z

General Description

Missing '_modules' attribute in ORTModule will cause load_state_dict for wrapped_ortmodule fail.
The ut of 'test_load_state_dict_for_wrapped_ortmodule' has not catch this problem is because it didn't copy the state_dict
and the two state_dicts shared the same memory.

Motivation and Context

reference:#7847

guyang3532 · 2023-02-03T16:22:45Z

I think a better solution should be forwarding the access of ORTModule._modules to TorchModule._modules to keep consistent rather than just copying it. But I have not figured out a good implementation. Do you have any suggestion? @baijumeswani @pengwa

orttraining/orttraining/python/training/ortmodule/ortmodule.py

baijumeswani · 2023-02-03T17:25:08Z

ORTModule.load_state_dict already forwards the call to the underlying torch model. Does that not work?

guyang3532 · 2023-02-07T09:36:56Z

ORTModule.load_state_dict already forwards the call to the underlying torch model. Does that not work?

As you described in #7847, because load_state_dict does not recursively call load_state_dict on its children, but instead it defines its own function load (inside load_state_dict) which does this task.

#14563) Missing '_modules' attribute in ORTModule will cause load_state_dict for wrapped_ortmodule fail. reference:#7847

guyang3532 changed the title ~~Forward access of ort_model._modules to torch_model._modules~~ [draft]Forward access of ort_model._modules to torch_model._modules Feb 3, 2023

guyang3532 force-pushed the fix6 branch from c83148b to 5775009 Compare February 3, 2023 16:02

guyang3532 changed the title ~~[draft]Forward access of ort_model._modules to torch_model._modules~~ [draft]set ort_model._modules to torch_model._modules Feb 3, 2023

baijumeswani added the training issues related to ONNX Runtime training; typically submitted using template label Feb 3, 2023

baijumeswani reviewed Feb 3, 2023

View reviewed changes

orttraining/orttraining/python/training/ortmodule/ortmodule.py Outdated Show resolved Hide resolved

guyang3532 force-pushed the fix6 branch 2 times, most recently from 74388e9 to 0d46c2a Compare February 7, 2023 09:29

guyang3532 changed the title ~~[draft]set ort_model._modules to torch_model._modules~~ Del ort_model._modules to foward it to torch_model._modules Feb 7, 2023

guyang3532 changed the title ~~Del ort_model._modules to foward it to torch_model._modules~~ Del ort_model._modules to foward its accessing to torch_model._modules Feb 7, 2023

baijumeswani previously approved these changes Feb 7, 2023

View reviewed changes

guyang3532 dismissed baijumeswani’s stale review via 7998742 February 11, 2023 15:13

guyang3532 force-pushed the fix6 branch from 0d46c2a to 7998742 Compare February 11, 2023 15:13

baijumeswani previously approved these changes Feb 11, 2023

View reviewed changes

guyang3532 dismissed baijumeswani’s stale review via caae194 February 23, 2023 05:41

guyang3532 force-pushed the fix6 branch from 7998742 to caae194 Compare February 23, 2023 05:41

Forward access of ort_model._modules to torch_model._modules

d15ded5

guyang3532 force-pushed the fix6 branch from caae194 to d15ded5 Compare February 27, 2023 09:08

baijumeswani approved these changes Feb 27, 2023

View reviewed changes

guyang3532 merged commit c49f250 into microsoft:main Mar 3, 2023

mszhanyi pushed a commit that referenced this pull request Mar 9, 2023

Del ort_model._modules to foward its accessing to torch_model._modules (

7c0d55b

#14563) Missing '_modules' attribute in ORTModule will cause load_state_dict for wrapped_ortmodule fail. reference:#7847

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Del ort_model._modules to foward its accessing to torch_model._modules #14563

Del ort_model._modules to foward its accessing to torch_model._modules #14563

guyang3532 commented Feb 3, 2023 •

edited

Loading

guyang3532 commented Feb 3, 2023 •

edited

Loading

baijumeswani commented Feb 3, 2023 •

edited

Loading

guyang3532 commented Feb 7, 2023

Del ort_model._modules to foward its accessing to torch_model._modules #14563

Del ort_model._modules to foward its accessing to torch_model._modules #14563

Conversation

guyang3532 commented Feb 3, 2023 • edited Loading

General Description

Motivation and Context

guyang3532 commented Feb 3, 2023 • edited Loading

baijumeswani commented Feb 3, 2023 • edited Loading

guyang3532 commented Feb 7, 2023

guyang3532 commented Feb 3, 2023 •

edited

Loading

guyang3532 commented Feb 3, 2023 •

edited

Loading

baijumeswani commented Feb 3, 2023 •

edited

Loading