[ONNX] Add training mode support for BatchNormalization op #3597

vivekkhandelwal1 · 2024-08-06T13:13:28Z

This commit extends the OnnxToTorch lowering for BatchNormalization op for supporting the case when training=True.

Signed-Off By: Vivek Khandelwal vivekkhandelwal1424@gmail.com

vivekkhandelwal1 · 2024-08-06T13:15:41Z

Fixes nod-ai/SHARK-ModelDev#285 (comment)

lib/Conversion/TorchOnnxToTorch/DefaultDomainAtoF.cpp

zjgarvey

This does not pass e2e testing for the following onnx node tests due to a significant numerics mismatch:

"test_batchnorm_epsilon_training_mode"
"test_batchnorm_example_training_mode"

Please try to debug the numerics mismatching when you get the chance. You can either use this branch of torch-mlir in your iree-build and run the iree_tests, or you can use the alt_e2eshark and run python run.py --torchtolinalg -t test_batch.

zjgarvey · 2024-08-06T17:39:48Z

From a glance, it looks like the numerics for Y and running_var are off, but running_mean seems to match pretty well.

This commit extends the OnnxToTorch lowering for BatchNormalization op for supporting the case when training=True. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

vivekkhandelwal1 · 2024-08-09T15:30:08Z

This does not pass e2e testing for the following onnx node tests due to a significant numerics mismatch:

"test_batchnorm_epsilon_training_mode" "test_batchnorm_example_training_mode"

Please try to debug the numerics mismatching when you get the chance. You can either use this branch of torch-mlir in your iree-build and run the iree_tests, or you can use the alt_e2eshark and run python run.py --torchtolinalg -t test_batch.

Hi @zjgarvey, the accuracy issue is fixed. After a lot of debugging, it turned out that the unbiased has to be set to False, instead of True.

vivekkhandelwal1 · 2024-08-12T12:22:11Z

@zjgarvey Can you please review this PR, today?

zjgarvey

As long as this is passing numerics, the implementation is clear and well-commented on. Thanks, Vivek.

vivekkhandelwal1 requested review from renxida, AmosLewis and zjgarvey August 6, 2024 13:13

vivekkhandelwal1 mentioned this pull request Aug 6, 2024

BatchNormalization nod-ai/SHARK-ModelDev#285

Closed

zjgarvey reviewed Aug 6, 2024

View reviewed changes

lib/Conversion/TorchOnnxToTorch/DefaultDomainAtoF.cpp Show resolved Hide resolved

zjgarvey requested changes Aug 6, 2024

View reviewed changes

[ONNX] Add training mode support for BatchNormalization op

c7ef086

This commit extends the OnnxToTorch lowering for BatchNormalization op for supporting the case when training=True. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

vivekkhandelwal1 force-pushed the onnx-batchnorm branch from 3d1a486 to c030a0b Compare August 9, 2024 15:29

Set unbiased to False as Onnx need

6ed8a18

vivekkhandelwal1 force-pushed the onnx-batchnorm branch from c030a0b to 6ed8a18 Compare August 9, 2024 16:05

vivekkhandelwal1 requested a review from zjgarvey August 9, 2024 16:05

AmosLewis mentioned this pull request Aug 10, 2024

[tracking] HF CNN fp32 model tests nod-ai/SHARK-ModelDev#801

Open

30 tasks

zjgarvey approved these changes Aug 13, 2024

View reviewed changes

vivekkhandelwal1 merged commit 4a0bed0 into llvm:main Aug 14, 2024
3 checks passed

jinchen62 mentioned this pull request Aug 29, 2024

[Tracker] All the issue related with e2e shark test suite nod-ai/SHARK-ModelDev#812

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ONNX] Add training mode support for BatchNormalization op #3597

[ONNX] Add training mode support for BatchNormalization op #3597

vivekkhandelwal1 commented Aug 6, 2024

vivekkhandelwal1 commented Aug 6, 2024 •

edited

Loading

zjgarvey left a comment

zjgarvey commented Aug 6, 2024

vivekkhandelwal1 commented Aug 9, 2024

vivekkhandelwal1 commented Aug 12, 2024

zjgarvey left a comment

[ONNX] Add training mode support for BatchNormalization op #3597

[ONNX] Add training mode support for BatchNormalization op #3597

Conversation

vivekkhandelwal1 commented Aug 6, 2024

vivekkhandelwal1 commented Aug 6, 2024 • edited Loading

zjgarvey left a comment

Choose a reason for hiding this comment

zjgarvey commented Aug 6, 2024

vivekkhandelwal1 commented Aug 9, 2024

vivekkhandelwal1 commented Aug 12, 2024

zjgarvey left a comment

Choose a reason for hiding this comment

vivekkhandelwal1 commented Aug 6, 2024 •

edited

Loading