Change any op implementation to be more similar to CPU #280

skotapati · 2023-02-02T00:45:46Z

Issue with any op implementation was responsible for segfaults seen in Inverse: #270. Changed how we handle empty input to match what is done on cpu side

aten/src/ATen/native/mps/operations/ReduceOps.mm

Co-authored-by: Ramin Azarmehr <razarmehr@apple.com>

kulinseth · 2023-02-04T01:29:13Z

test/test_mps.py

        'sgn': [torch.bool],
-        'linalg.inv': [torch.float32],
-        'linalg.inv_ex': [torch.float32],
+        # 'linalg.inv': [torch.float32],


Suggested change

# 'linalg.inv': [torch.float32],

kulinseth · 2023-02-04T01:29:29Z

test/test_mps.py

-        'linalg.inv': [torch.float32],
-        'linalg.inv_ex': [torch.float32],
+        # 'linalg.inv': [torch.float32],
+        # 'linalg.inv_ex': [torch.float32],


Suggested change

# 'linalg.inv_ex': [torch.float32],

* Fix addmm calculation: - ignore input when beta is 0, so that nan and inf will not be propagated. * - add is_beta_non_zero variable

- This should fix the hard crashes reported in BLOCKLIST_OP_GRAD such as sigmoid, tanh, masked_fill, linear, prelu, etc. - The prelu now fails with correctness issues so I kept it in the blocklist

- This should fix cross_entropy_backward() too - Take grad_output into account when computing nll_loss_backward() - Clean up

- Take grad_output into account when computing smooth_l1_backward() - Add numel()==0 check to prevent crashes - Clean up and formatting

- This patch moves several Grad tests related to FP16 precision from block list to FP16_LOW_PRECISION_LIST which should produce correct output.

- fix gelu_backward_out_mps key - uniform format - add caculation for tanh approximate backward pass - unblock grad test from blocklist - fix lintrunner errors

* Use DISTRIBUTED=1 for MPS CI runners * Disable openmp

Co-authored-by: sidk <sidk@sidks-mbp.lan>

…OUR, as zero to negative integer powers are undefined (#288)

- Add divisor to cached string key to avoid conflicts - Fallback to CPU when divisor is passed for int64 input types - Remove MaxPool2D from UNIMPLEMENTED_OP list (was put there by mistake) - Add test case when both ceil_mode and count_include_pad are True (previously failed).

Change any op implementation to be more similar to CPU

187f3fc

skotapati requested review from DenisVieriu97, Ronian526, razarmehr, shuhand0 and ssaladis February 2, 2023 00:45

skotapati requested a review from kulinseth as a code owner February 2, 2023 00:45

Added case to handle input tensor of size 1

59598d2

razarmehr reviewed Feb 2, 2023

View reviewed changes

aten/src/ATen/native/mps/operations/ReduceOps.mm Outdated Show resolved Hide resolved

sidk and others added 2 commits February 2, 2023 09:53

Added inverse fix to test if any change worked

fbe7e3e

Update aten/src/ATen/native/mps/operations/ReduceOps.mm

281bfd7

Co-authored-by: Ramin Azarmehr <razarmehr@apple.com>

kulinseth reviewed Feb 4, 2023

View reviewed changes

Ronian526 and others added 17 commits February 6, 2023 17:00

Fix addmm calculation: (#274)

eeff994

* Fix addmm calculation: - ignore input when beta is 0, so that nan and inf will not be propagated. * - add is_beta_non_zero variable

norm fp16: bump tol & unit tests from cuda (#240)

088b0c3

Fix crashes in several backward ops (#279)

fa23e50

- This should fix the hard crashes reported in BLOCKLIST_OP_GRAD such as sigmoid, tanh, masked_fill, linear, prelu, etc. - The prelu now fails with correctness issues so I kept it in the blocklist

Fix the crash in sgn_out_mps() with bool type (#282)

57c1d49

[MPS] Casting int64 to int32 for reduction ops. (#275)

f84387a

Fix grid sampler macos version check (#285)

0666754

Fix correctness issues with nll_loss_backward() (#289)

aaa6ce9

- This should fix cross_entropy_backward() too - Take grad_output into account when computing nll_loss_backward() - Clean up

Fix correctness issues with smooth_l1_loss() (#290)

ddf3469

- Take grad_output into account when computing smooth_l1_backward() - Add numel()==0 check to prevent crashes - Clean up and formatting

Fix FP16 precision issues for Grad tests (#281)

866a7d1

- This patch moves several Grad tests related to FP16 precision from block list to FP16_LOW_PRECISION_LIST which should produce correct output.

Fix gelu backward (#284)

c7ded9b

- fix gelu_backward_out_mps key - uniform format - add caculation for tanh approximate backward pass - unblock grad test from blocklist - fix lintrunner errors

Unblock nn.functional.layer_norm grad test. (#287)

ae9e8b5

Use DISTRIBUTED=1 for MPS CI runners (#292)

0638d0a

* Use DISTRIBUTED=1 for MPS CI runners * Disable openmp

Remove cumulative_trapezoid from blocklist (#291)

a11ad35

Co-authored-by: sidk <sidk@sidks-mbp.lan>

- move rpow torch.int16, torch.int32, torch.int64 to UNDEFINED_BEHAVI…

d403cff

…OUR, as zero to negative integer powers are undefined (#288)

test_to_non_contiguous fails (#283)

73b2168

Fix the crash in sgn_out_mps() with bool type (#282)

9ce751f

sidk and others added 2 commits February 6, 2023 17:01

Added inverse fix to test if any change worked

4d8a2b6

Inverse issue solved by MPSGraph change

2247466

skotapati closed this Feb 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Change any op implementation to be more similar to CPU #280

Change any op implementation to be more similar to CPU #280

Uh oh!

skotapati commented Feb 2, 2023

Uh oh!

Uh oh!

kulinseth Feb 4, 2023

Uh oh!

kulinseth Feb 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Change any op implementation to be more similar to CPU #280

Change any op implementation to be more similar to CPU #280

Uh oh!

Conversation

skotapati commented Feb 2, 2023

Uh oh!

Uh oh!

kulinseth Feb 4, 2023

Choose a reason for hiding this comment

Uh oh!

kulinseth Feb 4, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants