[ROCM] Support TF_ROCM_CLANG for builds with clang host compiler #2192

draganmladjenovic · 2023-08-14T14:09:54Z

Remove TF_NEED_CLANG workarounds

i-chaochen · 2023-08-18T13:29:20Z

retest Ubuntu-CPU please

jayfurmanek · 2023-08-18T14:52:22Z

We'll also need to update the symlink of rocm.bazelrc to point to gpu.bazelrc (instead of gpu_gcc.bazelrc)

jayfurmanek · 2023-08-18T14:52:43Z

https://github.com/ROCmSoftwarePlatform/tensorflow-upstream/blob/develop-upstream/tensorflow/tools/tf_sig_build_dockerfiles/devel.usertools/rocm.bazelrc

draganmladjenovic · 2023-08-31T17:49:44Z

Retest Ubuntu-GPU-single please
Retest Ubuntu-CPU please

i-chaochen · 2023-09-19T14:38:19Z

I wonder what's situation for this PR? are we capable of switching to clang now?

draganmladjenovic · 2023-10-26T15:59:57Z

Retest Ubuntu-CPU please.

draganmladjenovic · 2023-10-28T08:35:10Z

Retest Ubuntu-GPU-multi please.

draganmladjenovic · 2023-10-28T15:51:45Z

Retest Ubuntu-CPU please.
Retest Ubuntu-GPU-single please.

draganmladjenovic · 2023-10-30T11:23:11Z

@jayfurmanek I think this one is ready. I slightly dislike having to change root .bazelrc in the last commit. Alternative is to create /userlocal/ in Dockerfile.rocm and point bazel in run_gpu_single/multi.sh to use --confing=/userlocal/gpu.bazelrc.

draganmladjenovic · 2023-11-07T18:03:30Z

@i-chaochen Where can I update the docker image for ThirdParty-XLA ci?

i-chaochen · 2023-11-07T20:23:27Z

ThirdParty-XLA CI is using the same tensorflow docker image (rocm/tensorflow-build:latest-python3.9-rocm5.7.0), may I ask what do you mean you want to update it, and why you want to update it?

draganmladjenovic · 2023-11-08T15:04:33Z

ThirdParty-XLA CI is using the same tensorflow docker image (rocm/tensorflow-build:latest-python3.9-rocm5.7.0), may I ask what do you mean you want to update it, and why you want to update it?

It is missing clang-16, so the PR fails, and cannot be merged.

i-chaochen · 2023-11-08T16:24:31Z

rocm/tensorflow-build:latest-python3.9-rocm5.7.0

I think you need to add clang-16 into these three:

https://github.com/ROCmSoftwarePlatform/tensorflow-upstream/blob/develop-upstream/tensorflow/tools/tf_sig_build_dockerfiles/Dockerfile.rocm

https://github.com/ROCmSoftwarePlatform/tensorflow-upstream/blob/develop-upstream/tensorflow/tools/tf_sig_build_dockerfiles/Dockerfile.rocm.rt

https://github.com/ROCmSoftwarePlatform/tensorflow-upstream/blob/develop-upstream/tensorflow/tools/tf_sig_build_dockerfiles/Dockerfile.rocm.ub20

rocm/tensorflow-build:latest-python3.9-rocm5.7.0 will be updated the latest build.

draganmladjenovic · 2023-11-08T17:29:35Z

rocm/tensorflow-build:latest-python3.9-rocm5.7.0

I think you need to add clang-16 into these three:

https://github.com/ROCmSoftwarePlatform/tensorflow-upstream/blob/develop-upstream/tensorflow/tools/tf_sig_build_dockerfiles/Dockerfile.rocm

https://github.com/ROCmSoftwarePlatform/tensorflow-upstream/blob/develop-upstream/tensorflow/tools/tf_sig_build_dockerfiles/Dockerfile.rocm.rt

https://github.com/ROCmSoftwarePlatform/tensorflow-upstream/blob/develop-upstream/tensorflow/tools/tf_sig_build_dockerfiles/Dockerfile.rocm.ub20

rocm/tensorflow-build:latest-python3.9-rocm5.7.0 will be updated the latest build.
Ok. I'll split this PR.

draganmladjenovic · 2023-11-14T05:23:23Z

ThirdParty-XLA CI is using the same tensorflow docker image (rocm/tensorflow-build:latest-python3.9-rocm5.7.0), may I ask what do you mean you want to update it, and why you want to update it?

It is missing clang-16, so the PR fails, and cannot be merged.

My bad. This CI is not blocking the merge. @jayfurmanek The only thing that is missing is the review. Should I include more people into reviewers?

i-chaochen · 2023-11-14T16:00:04Z

tensorflow/tools/tf_sig_build_dockerfiles/devel.usertools/gpu.bazelrc

+build:rocm --action_env=CLANG_COMPILER_PATH="/usr/lib/llvm-16/bin/clang"
+# Disable unused-result on rocm builds.
+build:rocm --copt="-Wno-error=unused-result"
+


Since we're switching to use gpu.bazelrc, could you also add xla_cpp_filters from gpu_gcc.bazelrc ? Thanks!

jayfurmanek · 2023-11-29T14:05:53Z

tensorflow/tools/tf_sig_build_dockerfiles/devel.usertools/gpu.bazelrc

@@ -15,6 +15,21 @@ build --action_env=CACHEBUSTER=565341047
 # Build options for GPU Linux
 build --config=release_gpu_linux


This line pulls in the cuda config from the .bazelrc.
We'll need a "release_rocm_linux" config there I think..

jayfurmanek · 2023-11-29T14:06:32Z

tensorflow/tools/tf_sig_build_dockerfiles/devel.usertools/gpu.bazelrc

+build:rocm --define=tensorflow_mkldnn_contraction_kernel=0
+build:rocm --repo_env TF_NEED_ROCM=1
+build:rocm --action_env=TF_ROCM_CLANG="1"
+build:rocm --repo_env=BAZEL_COMPILER="/usr/lib/llvm-16/bin/clang"


Can we update to clang-17 to match what's upstream now?

That will need to be changed in the container as well (merged separately)

Can we update to clang-17 to match what's upstream now?

Thanks. Missed that. Will update.

jayfurmanek · 2023-11-29T14:07:33Z

tensorflow/tools/tf_sig_build_dockerfiles/devel.usertools/gpu.bazelrc

@@ -15,6 +15,21 @@ build --action_env=CACHEBUSTER=565341047
 # Build options for GPU Linux
 build --config=release_gpu_linux

+# ROCM: Set up compilation ROCM version and paths
+build:rocm --linkopt="-fuse-ld=gold"


I think we want to use ldd for the linker

I think I had some issues getting lld on all docker images. I will have to check that.

Remove TF_NEED_CLANG workarounds

Test doceker doesn't have /userlocal/gpu.bazelrc so flip the switch in root .bazelrc.

jayfurmanek · 2023-12-07T22:41:23Z

retest gpu-pycpp please

jayfurmanek · 2023-12-07T22:56:14Z

retest gpu-non-pip-multi please

jayfurmanek · 2023-12-07T22:56:37Z

retest Ubuntu-GPU-multi please

draganmladjenovic · 2023-12-08T13:15:04Z

Retest Ubuntu-GPU-single please.

jayfurmanek

Looks good!
Please tag the repo before and after merging:
pre-clang-merge
post-clang-merge

draganmladjenovic requested a review from jayfurmanek August 14, 2023 14:10

i-chaochen mentioned this pull request Sep 19, 2023

Develop upstream sync 230912 #2232

Merged

draganmladjenovic force-pushed the clang_build branch from 2ff01fa to c447595 Compare October 25, 2023 18:57

draganmladjenovic force-pushed the clang_build branch 3 times, most recently from 065bd06 to 2a4f1f7 Compare October 28, 2023 08:05

draganmladjenovic force-pushed the clang_build branch 4 times, most recently from 6300e9a to cbdf070 Compare October 28, 2023 11:17

draganmladjenovic force-pushed the clang_build branch 2 times, most recently from be01b2d to 98885a3 Compare November 13, 2023 23:48

i-chaochen reviewed Nov 14, 2023

View reviewed changes

jayfurmanek reviewed Nov 29, 2023

View reviewed changes

draganmladjenovic force-pushed the clang_build branch 5 times, most recently from c549c5f to 43db4b8 Compare December 7, 2023 20:43

draganmladjenovic added 3 commits December 7, 2023 20:45

[ROCM] Support TF_ROCM_CLANG for builds with clang host compiler

2714570

Remove TF_NEED_CLANG workarounds

Point rocm.bazelrc to gpu.bazelrc

66f31fb

Use clang for rocm in root .bazelrc

35fa852

Test doceker doesn't have /userlocal/gpu.bazelrc so flip the switch in root .bazelrc.

draganmladjenovic force-pushed the clang_build branch from 43db4b8 to b023dcd Compare December 7, 2023 20:50

Move to clang-17 and use lld

db77d4a

draganmladjenovic force-pushed the clang_build branch from b023dcd to db77d4a Compare December 8, 2023 05:04

jayfurmanek approved these changes Dec 8, 2023

View reviewed changes

draganmladjenovic merged commit 300e4cb into develop-upstream Dec 9, 2023
9 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCM] Support TF_ROCM_CLANG for builds with clang host compiler #2192

[ROCM] Support TF_ROCM_CLANG for builds with clang host compiler #2192

draganmladjenovic commented Aug 14, 2023

i-chaochen commented Aug 18, 2023

jayfurmanek commented Aug 18, 2023

jayfurmanek commented Aug 18, 2023

draganmladjenovic commented Aug 31, 2023

i-chaochen commented Sep 19, 2023

draganmladjenovic commented Oct 26, 2023

draganmladjenovic commented Oct 28, 2023

draganmladjenovic commented Oct 28, 2023

draganmladjenovic commented Oct 30, 2023

draganmladjenovic commented Nov 7, 2023

i-chaochen commented Nov 7, 2023

draganmladjenovic commented Nov 8, 2023

i-chaochen commented Nov 8, 2023

draganmladjenovic commented Nov 8, 2023

draganmladjenovic commented Nov 14, 2023

i-chaochen Nov 14, 2023

jayfurmanek Nov 29, 2023

jayfurmanek Nov 29, 2023

jayfurmanek Nov 29, 2023

draganmladjenovic Nov 29, 2023

jayfurmanek Nov 29, 2023

draganmladjenovic Nov 29, 2023

jayfurmanek commented Dec 7, 2023

jayfurmanek commented Dec 7, 2023

jayfurmanek commented Dec 7, 2023

draganmladjenovic commented Dec 8, 2023

jayfurmanek left a comment

		@@ -15,6 +15,21 @@ build --action_env=CACHEBUSTER=565341047
		# Build options for GPU Linux
		build --config=release_gpu_linux

[ROCM] Support TF_ROCM_CLANG for builds with clang host compiler #2192

[ROCM] Support TF_ROCM_CLANG for builds with clang host compiler #2192

Conversation

draganmladjenovic commented Aug 14, 2023

i-chaochen commented Aug 18, 2023

jayfurmanek commented Aug 18, 2023

jayfurmanek commented Aug 18, 2023

draganmladjenovic commented Aug 31, 2023

i-chaochen commented Sep 19, 2023

draganmladjenovic commented Oct 26, 2023

draganmladjenovic commented Oct 28, 2023

draganmladjenovic commented Oct 28, 2023

draganmladjenovic commented Oct 30, 2023

draganmladjenovic commented Nov 7, 2023

i-chaochen commented Nov 7, 2023

draganmladjenovic commented Nov 8, 2023

i-chaochen commented Nov 8, 2023

draganmladjenovic commented Nov 8, 2023

draganmladjenovic commented Nov 14, 2023

i-chaochen Nov 14, 2023

Choose a reason for hiding this comment

jayfurmanek Nov 29, 2023

Choose a reason for hiding this comment

jayfurmanek Nov 29, 2023

Choose a reason for hiding this comment

jayfurmanek Nov 29, 2023

Choose a reason for hiding this comment

draganmladjenovic Nov 29, 2023

Choose a reason for hiding this comment

jayfurmanek Nov 29, 2023

Choose a reason for hiding this comment

draganmladjenovic Nov 29, 2023

Choose a reason for hiding this comment

jayfurmanek commented Dec 7, 2023

jayfurmanek commented Dec 7, 2023

jayfurmanek commented Dec 7, 2023

draganmladjenovic commented Dec 8, 2023

jayfurmanek left a comment

Choose a reason for hiding this comment