add sycl KI for InferenceBuilder by using syclomatic by baodii · Pull Request #21 · delock/DeepSpeedSYCLSupport

baodii · 2023-10-18T06:43:18Z

as hipify, we only support JIT mode
we only support inference kernels now
delete other OpBuilders in op_builder/xpu/init.py and accelerator/xpu_accelerator.py
add rule.YAML, pre_process.sh and post_process.sh to help us justify source code and generated code
we don't change original cuda code. we will copy original cuda code to build folder
sycl code will be generated in third-party folder
sycl code will be generated but not compiled when install deepspeed as hipify

enable jit_load for sycl kernels

baodii · 2023-10-18T06:46:27Z

@delock @CaoZhongZ @rogerxfeng8 please review

delock · 2023-10-18T06:50:16Z

op_builder/xpu/transformer_inference.py

+            return False
+
+        cuda_okay = True
+        if not self.is_rocm_pytorch() and not self.is_sycl_enabled() and torch.cuda.is_available():


why there is changes for cuda and rocm?

already deleted.

delock · 2023-10-18T06:52:14Z

op_builder/xpu/builder.py

 from deepspeed.ops.op_builder.builder import OpBuilder, TORCH_MAJOR, TORCH_MINOR

-
 class SYCLOpBuilder(OpBuilder):


shouldn't there be two kinds of builder, SYCLOpBuilder and SYCLAutoOpBuilder, and SYCLAutoOpBuilder works for ops with syclomatic?

shouldn't there be two kinds of builder, SYCLOpBuilder and SYCLAutoOpBuilder, and SYCLAutoOpBuilder works for ops with syclomatic?

add SYCLAutoOpBuilder below.

* move scripts path to op_builder/xpu

delock · 2023-10-20T02:43:33Z

accelerator/xpu_accelerator.py

-            return FusedAdamBuilder
+            from deepspeed.ops.op_builder.xpu import CPUAdagradBuilder, CPUAdamBuilder, FusedAdamBuilder, AsyncIOBuilder, InferenceBuilder
+
+        if class_name == "InferenceBuilder":


This change seems to turn off CPUAdagradBuilder, CPUAdamBuilder, FusedAdamBuilder, AsyncIOBuilder. Is this intended?

delock · 2023-10-20T02:45:10Z

.gitignore


 # Build + installation data
 build/
+third-party/


Is third-party/ used to store generated sycl kernels? What will the directory structure be under third-party?

delock · 2023-10-20T02:49:37Z

op_builder/xpu/post_process.sh

+find ./deepspeed/third-party/ -type f -exec sed -i "s/at::kCUDA/at::kXPU/g" {} +
+
+# fix pt_binding.cpp torch::from_blob 4 inputs pattern
+patch ./deepspeed/third-party/csrc/transformer/inference/csrc/pt_binding.cpp << 'DIFF___'


What is the error intend to fix? Why can't it be fixed in the source code directory? Is it temporary or will persist? What do we need to do when pt_binding.cpp get changed?

delock · 2023-10-20T02:53:13Z

op_builder/xpu/builder.py

+        return dpcpp_ext
+
+    def sycl_extension(self):
+        if self.is_sycl_enabled():


This function is very long. Can we extract two smaller functions? one for include, one for source.

delock · 2023-10-20T02:53:36Z

op_builder/xpu/builder.py

+                    trans_cmd = c2s_cmd + cuda_inc_flag + extra_args + in_root + out_root + cuda_source
+                    print("**** processing ", f'{trans_cmd}')
+                    p = subprocess.Popen(f'{trans_cmd}', stdout=subprocess.PIPE, shell=True)
+                    # processes_running.append(p)


Please remove code that is not needed.

delock · 2023-10-20T02:55:07Z

op_builder/xpu/pre_process.sh

+find ./build/csrc -type f -exec sed -i "s/torch::from_blob/at::from_blob/g" {} +
+
+# fix inference_context.h to make it could be migrate
+patch ./build/csrc/transformer/inference/includes/inference_context.h << 'DIFF___'


Why can't change source code directly?

Why can't change source code directly?

add changes in source codes.

delock · 2023-10-20T02:56:05Z

@baodii comments added. Please also fix format error by turning on pre-commit in your environment.

baodii · 2023-10-23T10:28:22Z

@baodii comments added. Please also fix format error by turning on pre-commit in your environment.

some license check in csrc/xpu folder not pass.

baodii · 2023-10-24T03:40:05Z

@delock I have fixed these issues. Please review.

baodii added 2 commits October 17, 2023 23:01

add sylomatic code into upstream

3c9bb8f

enable jit_load for sycl kernels

find Python.h using general code

5f15b64

delock reviewed Oct 18, 2023

View reviewed changes

baodii added 4 commits October 18, 2023 00:22

* add SYCLAutoOpBuilder to support InferenceOpBuilder

48dc17c

* move scripts path to op_builder/xpu

only change cuda files extension

4f49b05

delete unused code in inferenceBuilder

a69f3b5

change third-party relative path to enabel python install

65f4729

delock reviewed Oct 20, 2023

View reviewed changes

baodii added 3 commits October 23, 2023 01:40

extracty smaller functions from sycl_extension

a357faf

change from_blob in source code to avoid big part post processing

1c27e28

run pre-commit

a2234a2

baodii added 3 commits October 23, 2023 18:29

Merge branch 'gma/xpu_upstream' into baodi/syclomatic

ca3ed6a

add BF16 support

99fa8d3

add license to csrc/xpu code

9224aed

delock merged commit c9ef1ef into delock:gma/xpu_upstream Oct 30, 2023

		from deepspeed.ops.op_builder.builder import OpBuilder, TORCH_MAJOR, TORCH_MINOR


		class SYCLOpBuilder(OpBuilder):

Conversation

baodii commented Oct 18, 2023

Uh oh!

baodii commented Oct 18, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

delock Oct 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

delock commented Oct 20, 2023

Uh oh!

baodii commented Oct 23, 2023

Uh oh!

baodii commented Oct 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

delock Oct 20, 2023 •

edited

Loading