Enable triton sparse gemm only for CUDA #27

hsharsha · 2024-07-05T10:02:15Z

No description provided.

i-chaochen · 2024-07-05T10:53:06Z

xla/service/elemental_ir_emitter.cc

  const HloDotInstruction* dot = Cast<HloDotInstruction>(hlo);
  if (dot->sparse_operands()) {
    return Unimplemented("Sparse dot is supported by Triton emitter only.");
  }
+#endif


IIUC, sparse dot will require AddGemmFusionAutotuningPasses and that not only requires triton auotune, but also cudnn fusion front as well.

This is to unblock JAX to use sparse dot operation. https://github.com/ROCm/frameworks-internal/issues/8118

hsharsha · 2024-07-08T17:00:36Z

Converting to draft as we also need to address failing tests.

Ruturaj4 · 2024-07-05T13:27:48Z

xla/service/elemental_ir_emitter.cc

@@ -2921,10 +2921,12 @@ absl::StatusOr<llvm::Value*> ElementalIrEmitter::EmitElementalDot(
        "Algorithm not supported by the ElementalIrEmitter: %s",
        PrecisionConfig::Algorithm_Name(hlo->precision_config().algorithm())));
  }
+#ifdef GOOGLE_CUDA


@hsharsha @i-chaochen
I believe better is to use something like ->

#ifndef TENSORFLOW_USE_ROCM

and also ->

local_defines = if_cuda_is_configured(["GOOGLE_CUDA=1"]) + if_rocm_is_configured(["TENSORFLOW_USE_ROCM=1"]), to //xla/service:elemental_ir_emitter build target

Enable triton sparse gemm only for CUDA

deee85c

hsharsha requested a review from i-chaochen July 5, 2024 10:02

github-actions bot added the kokoro:force-run label Jul 5, 2024

hsharsha requested a review from Ruturaj4 July 5, 2024 10:02

i-chaochen reviewed Jul 5, 2024

View reviewed changes

i-chaochen approved these changes Jul 8, 2024

View reviewed changes

hsharsha marked this pull request as draft July 8, 2024 17:00

Ruturaj4 reviewed Jul 12, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable triton sparse gemm only for CUDA #27

Enable triton sparse gemm only for CUDA #27

hsharsha commented Jul 5, 2024

i-chaochen Jul 5, 2024 •

edited

Loading

hsharsha Jul 8, 2024

hsharsha commented Jul 8, 2024

Ruturaj4 Jul 5, 2024

Enable triton sparse gemm only for CUDA #27

Are you sure you want to change the base?

Enable triton sparse gemm only for CUDA #27

Conversation

hsharsha commented Jul 5, 2024

i-chaochen Jul 5, 2024 • edited Loading

Choose a reason for hiding this comment

hsharsha Jul 8, 2024

Choose a reason for hiding this comment

hsharsha commented Jul 8, 2024

Ruturaj4 Jul 5, 2024

Choose a reason for hiding this comment

i-chaochen Jul 5, 2024 •

edited

Loading