Develop upstream sync 241105 #2744

mmakevic-amd · 2024-11-05T14:41:38Z

No description provided.

PiperOrigin-RevId: 691456562

PiperOrigin-RevId: 691468873

This can be reverted to getCurrentVersion once frameworks release with the fix in: openxla/xla@2f99455 Until then, a plugin that is newer than its framework will error on serialization, this feature was added in 1.7.X, so pinning to 1.7.0 should be safe. PiperOrigin-RevId: 691471627

PiperOrigin-RevId: 691480719

…-with-null-data-for-simple-dynamic-buffer PiperOrigin-RevId: 691481168

…ling test cases. Only test cases breaking on CPU are related to: - pure callbacks - export - shard alike Note that `layout_test` is broken on TPU, leaving a comment saying to enable it. Also fixed `shard_map_test` test that was broken when running Shardy on one TPU, and `aot_test` which was breaking due to calling a different C++ StableHLO compilation function. PiperOrigin-RevId: 691496997

…to stay consistent with hermetic CUDA PiperOrigin-RevId: 691506371

PiperOrigin-RevId: 691516394

…low/compiler/xla/service/spmd/shardy/mhlo_round_trip/export_shardings.cc. PiperOrigin-RevId: 691525161

PiperOrigin-RevId: 691528885

PiperOrigin-RevId: 691539371

PiperOrigin-RevId: 691565219

2. Set tasks to error (and don't disconnect tasks) during a failed shutdown to avoid silent reconnects) PiperOrigin-RevId: 691586482

Updates LLVM usage to match [4ba623f24479](llvm/llvm-project@4ba623f24479) PiperOrigin-RevId: 691589403

…til.h/cc PiperOrigin-RevId: 691591073

PiperOrigin-RevId: 691602092

PiperOrigin-RevId: 691648381

And fix version check for Dispatch API PiperOrigin-RevId: 691663743

PiperOrigin-RevId: 691667184

PiperOrigin-RevId: 691667192

PiperOrigin-RevId: 691675947

PiperOrigin-RevId: 691681901

PiperOrigin-RevId: 691692633

PiperOrigin-RevId: 691718921

PiperOrigin-RevId: 691720760

PiperOrigin-RevId: 691720775

PiperOrigin-RevId: 691739227

…introduced in `c3d5769` Imported from GitHub PR openxla/xla#18860 Copybara import of the project: -- 1b61efe8270e67140199bbbb70665955fbaa6656 by Harsha HS <Harsha.HavanurShamsundara@amd.com>: [ROCm] Remove IsEmpty check for execution_order introduced in c3d5769 Merging this change closes tensorflow#18860 PiperOrigin-RevId: 691744865

These functions are not declared in any header file, hence they should have internal linkage. PiperOrigin-RevId: 691746918

PiperOrigin-RevId: 691750937

…legacy emitters. The default lowering in the MLIR repo is not stable for small imag(arg). PiperOrigin-RevId: 693262812

PiperOrigin-RevId: 693264535

… efficently PiperOrigin-RevId: 693265606

The test failures are due to the fact that the names of kernels from CUDA are not deterministic. PiperOrigin-RevId: 693272080

…il out if it fails. Add a large number of tests extracted from triton_fusion_emitter_device_legacy_test. PiperOrigin-RevId: 693278160

PiperOrigin-RevId: 693279863

PiperOrigin-RevId: 693283395

…ir/tensorflow/transforms/executor_island_coarsening.cc PiperOrigin-RevId: 693288112

PiperOrigin-RevId: 693289401

Imported from GitHub PR openxla/xla#18948 Copybara import of the project: -- 80e717c39e8a120cca974dca9f473d817d3a3457 by Ilia Sergachev <isergachev@nvidia.com>: [GPU][NFC] Improve error messages. Merging this change closes tensorflow#18948 PiperOrigin-RevId: 693291127

…/compiler/mlir/tosa/transforms/convert_tfl_uint8.cc. PiperOrigin-RevId: 693300221

…s much as possible. This is particularly useful in FSDP/HSDP where gradient propagation can be done fully in the i+1th iteration. It takes the responsibility of the user to set the `xla_gpu_all_reduce_combine_threshold_bytes` by themselves. PiperOrigin-RevId: 693304915

…sync-241105

mmakevic-amd · 2024-11-06T12:31:26Z

retest gpu-pycpp please

This reverts commit 1c5d7d4.

changm and others added 30 commits October 30, 2024 10:01

Internal CI/CD change

563dd9c

PiperOrigin-RevId: 691456562

Add LoadSerializedExecutable to TfrtCpuClient.

79dd13c

PiperOrigin-RevId: 691468873

tf2xla: Verify the MLIR we're about to convert to a string.

6e93dbb

PiperOrigin-RevId: 691480719

Merge pull request tensorflow#77167 from cdesouza-chromium:fix-memcpy…

0775507

…-with-null-data-for-simple-dynamic-buffer PiperOrigin-RevId: 691481168

Create implicit_cuda_headers_dependency alias in non-hermetic CUDA …

6f7945f

…to stay consistent with hermetic CUDA PiperOrigin-RevId: 691506371

Rename APIs consistently with new LiteRT naming convention

dea7fc1

PiperOrigin-RevId: 691516394

Cleanup. Update include and remove unused code in third_party/tensorf…

7ed33d9

…low/compiler/xla/service/spmd/shardy/mhlo_round_trip/export_shardings.cc. PiperOrigin-RevId: 691525161

[XLA:GPU] Unify the use of $direction in collective select folder test

7891c98

PiperOrigin-RevId: 691528885

Reverts 563dd9c

5c97317

PiperOrigin-RevId: 691539371

Crop landing images so there is less whitespace in XLA README.

5e7abae

PiperOrigin-RevId: 691565219

1. Don't propagate errors from recoverable tasks.

c6c9c59

2. Set tasks to error (and don't disconnect tasks) during a failed shutdown to avoid silent reconnects) PiperOrigin-RevId: 691586482

Integrate LLVM at llvm/llvm-project@4ba623f24479

d3720fe

Updates LLVM usage to match [4ba623f24479](llvm/llvm-project@4ba623f24479) PiperOrigin-RevId: 691589403

Delete unused serialize functions. There were replaced by byte_code_u…

03f5749

…til.h/cc PiperOrigin-RevId: 691591073

Add IFRT se gpu client and tests accordingly.

4c20ab4

PiperOrigin-RevId: 691602092

Set custom_option_alignment when exporting tflite.

ba07927

PiperOrigin-RevId: 691648381

Add version check for Compiler Plugin API

ccdf2a0

And fix version check for Dispatch API PiperOrigin-RevId: 691663743

Automated Code Change

735f149

PiperOrigin-RevId: 691667184

Automated Code Change

6c1fb94

PiperOrigin-RevId: 691667192

Automated Code Change

c9806b2

PiperOrigin-RevId: 691675947

Automated Code Change

8af0d89

PiperOrigin-RevId: 691681901

Automated Code Change

be5dbd9

PiperOrigin-RevId: 691692633

Remove unused variable from base_ops_test.h

5f6e3c4

PiperOrigin-RevId: 691718921

Update GraphDef version to 2032.

33cb370

PiperOrigin-RevId: 691720760

compat: Update forward compatibility horizon to 2024-10-31

fb1f654

PiperOrigin-RevId: 691720775

Automated Code Change

b460b37

PiperOrigin-RevId: 691739227

Wrap private functions in anonymous namespace in gpu_command_buffer.cc

1b88a6b

These functions are not declared in any header file, hence they should have internal linkage. PiperOrigin-RevId: 691746918

Reverts 9d5bb83

1492c2c

PiperOrigin-RevId: 691750937

pifon2a and others added 14 commits November 5, 2024 02:43

[XLA:GPU][Emitters] Port the complex.expm1 approximation used in the …

e0ccb4b

…legacy emitters. The default lowering in the MLIR repo is not stable for small imag(arg). PiperOrigin-RevId: 693262812

Automated Code Change

6026ad4

PiperOrigin-RevId: 693264535

Update XNNPack doc to reflect that XNNPack can handle dynamic tensors…

f9d6ddd

… efficently PiperOrigin-RevId: 693265606

[XLA:GPU] Fix test failures on Hopper for CUDA 12.6.2

c4bf15b

The test failures are due to the fact that the names of kernels from CUDA are not deterministic. PiperOrigin-RevId: 693272080

[XLA:GPU] Nest gemm fusions: hoist bitcasts up and down, but don't ba…

2798123

…il out if it fails. Add a large number of tests extracted from triton_fusion_emitter_device_legacy_test. PiperOrigin-RevId: 693278160

Make filegroups in third_party/triton public

23f69eb

PiperOrigin-RevId: 693279863

Automated Code Change

d4d59e6

PiperOrigin-RevId: 693283395

Fix a dangling llvm::function_ref reference in tensorflow/compiler/ml…

09b8b49

…ir/tensorflow/transforms/executor_island_coarsening.cc PiperOrigin-RevId: 693288112

[XLA:GPU] Update test names to reflect what is now being tested.

2f66af0

PiperOrigin-RevId: 693289401

Fix a dangling llvm::function_ref reference in third_party/tensorflow…

7b5dab1

…/compiler/mlir/tosa/transforms/convert_tfl_uint8.cc. PiperOrigin-RevId: 693300221

Merge remote-tracking branch 'upstream/master' into develop-upstream-…

68c43a9

…sync-241105

Fix merge conflicts

2baacd6

mmakevic-amd added 11 commits November 7, 2024 09:58

Disable svd tests

1c5d7d4

Skip newly added Triton subtest in dot_algorithms_test

4d45349

Keep track of loaded kernels in rocm executor

d7f29c0

Revert "Disable svd tests"

4ca780c

This reverts commit 1c5d7d4.

Fix grid dimension issue (openxla/xla#19162)

3ac1ead

Fix block dim limit in gpu_kernel_tiling_test

ed3e40f

Temporarily disable gpu_input_fusible_slice_test

5788ee9

Disable gpu_too_many_blocks_test

7550346

Disable complex_unary_op_test

04c3e86

Disable failing tensorflow tests

e2acb52

Temporarily disable falky check in rocm_stream_check

1d38891

mmakevic-amd requested review from i-chaochen and hsharsha November 28, 2024 10:48

hsharsha approved these changes Nov 29, 2024

View reviewed changes

mmakevic-amd merged commit 66ca76e into develop-upstream Dec 2, 2024
3 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Develop upstream sync 241105 #2744

Develop upstream sync 241105 #2744

mmakevic-amd commented Nov 5, 2024

mmakevic-amd commented Nov 6, 2024

Develop upstream sync 241105 #2744

Develop upstream sync 241105 #2744

Conversation

mmakevic-amd commented Nov 5, 2024

mmakevic-amd commented Nov 6, 2024