Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AMDGPU: Move attributor into optimization pipeline #83131

Merged
merged 15 commits into from
Jul 14, 2024

Conversation

arsenm
Copy link
Contributor

@arsenm arsenm commented Feb 27, 2024

Removing it from the codegen pipeline induces a lot of test churn because llc is no longer optimizing out implicit arguments to kernels.

Mostly mechanical, but there are some creative test updates. I preferred to take the changes as-is in tests where the ABI isn't relevant. In cases where it's more relevant, or the optimize out logic was too ingrained in the test, I pre-run the optimization. Some cases manually add attributes to disable inputs.

Removing it from the codegen pipeline induces a lot of test churn
because llc is no longer optimizing out implicit arguments to kernels.

Mostly mechanical, but there are some creative test updates. I preferred
to take the changes as-is in tests where the ABI isn't relevant. In cases
where it's more relevant, or the optimize out logic was too ingrained in the test,
I pre-run the optimization. Some cases manually add attributes to disable inputs.
@llvmbot
Copy link
Member

llvmbot commented Feb 27, 2024

@llvm/pr-subscribers-llvm-globalisel
@llvm/pr-subscribers-llvm-transforms

@llvm/pr-subscribers-backend-amdgpu

Author: Matt Arsenault (arsenm)

Changes

Removing it from the codegen pipeline induces a lot of test churn because llc is no longer optimizing out implicit arguments to kernels.

Mostly mechanical, but there are some creative test updates. I preferred to take the changes as-is in tests where the ABI isn't relevant. In cases where it's more relevant, or the optimize out logic was too ingrained in the test, I pre-run the optimization. Some cases manually add attributes to disable inputs.


Patch is 17.97 MiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/83131.diff

534 Files Affected:

  • (modified) llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp (+8-5)
  • (modified) llvm/lib/Target/AMDGPU/SIFrameLowering.cpp (+6)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/addsubu64.ll (+8-8)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/atomicrmw_udec_wrap.ll (+278-259)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/atomicrmw_uinc_wrap.ll (+297-278)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/bool-legalization.ll (+4-4)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/call-outgoing-stack-args.ll (+10-10)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/cvt_f32_ubyte.ll (+38-38)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/divergent-control-flow.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/dynamic-alloca-uniform.ll (+15-15)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/extractelement.ll (+138-138)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/flat-scratch-init.ll (+5-3)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/flat-scratch.ll (+104-88)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/fp-atomics-gfx940.ll (+16-16)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/fp64-atomics-gfx90a.ll (+258-258)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/frem.ll (+56-56)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/implicit-kernarg-backend-usage-global-isel.ll (+20-20)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/inline-asm-mismatched-size.ll (+3)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement-stack-lower.ll (+3-3)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/insertelement.large.ll (+5-3)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/irtranslator-amdgpu_kernel-system-sgprs.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/irtranslator-amdgpu_kernel.ll (+236-236)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/irtranslator-fence.ll (+120)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/irtranslator-sibling-call.ll (+121-148)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/lds-global-value.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/lds-zero-initializer.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.div.scale.ll (+295-233)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.end.cf.i32.ll (+5-5)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.end.cf.i64.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.global.atomic.csub.ll (+12-12)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.global.atomic.fadd.ll (+4-4)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.if.break.i32.ll (+9-9)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.if.break.i64.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.intersect_ray.ll (+58-55)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.is.private.ll (+12-11)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.is.shared.ll (+12-11)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.kernarg.segment.ptr.ll (+4-3)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.mfma.gfx90a.ll (+17-17)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.mov.dpp.ll (+12-12)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.queue.ptr.ll (+13-2)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.sbfe.ll (+49-49)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.set.inactive.ll (+12-12)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.trig.preop.ll (+16-16)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.ubfe.ll (+63-63)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.update.dpp.ll (+8-7)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.workgroup.id.ll (+5-4)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.workitem.id.ll (+11-9)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/localizer.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/memory-legalizer-atomic-fence.ll (+42-40)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/mul-known-bits.i64.ll (+66-48)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/mul.ll (+12-12)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/non-entry-alloca.ll (+23-23)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/sdivrem.ll (+369-369)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/shl-ext-reduce.ll (+18-17)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/store-local.128.ll (+86-86)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/store-local.96.ll (+86-86)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/udivrem.ll (+242-242)
  • (modified) llvm/test/CodeGen/AMDGPU/GlobalISel/widen-i8-i16-scalar-loads.ll (+30-30)
  • (modified) llvm/test/CodeGen/AMDGPU/add.ll (+168-159)
  • (modified) llvm/test/CodeGen/AMDGPU/add.v2i16.ll (+122-102)
  • (modified) llvm/test/CodeGen/AMDGPU/addrspacecast.ll (+4-2)
  • (modified) llvm/test/CodeGen/AMDGPU/agpr-copy-no-free-registers.ll (+65-64)
  • (modified) llvm/test/CodeGen/AMDGPU/agpr-register-count.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/always-uniform.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/amd.endpgm.ll (+17-17)
  • (modified) llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-fold-binop-select.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-idiv.ll (+1511-1504)
  • (modified) llvm/test/CodeGen/AMDGPU/amdgpu-mul24-knownbits.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/amdgpu-simplify-libcall-sincos.ll (+24-24)
  • (modified) llvm/test/CodeGen/AMDGPU/amdgpu.private-memory.ll (+3-3)
  • (modified) llvm/test/CodeGen/AMDGPU/amdgpu.work-item-intrinsics.deprecated.ll (+18-18)
  • (modified) llvm/test/CodeGen/AMDGPU/amdpal-elf.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/anyext.ll (+19-19)
  • (modified) llvm/test/CodeGen/AMDGPU/atomic_optimizations_buffer.ll (+808-702)
  • (modified) llvm/test/CodeGen/AMDGPU/atomic_optimizations_global_pointer.ll (+701-592)
  • (modified) llvm/test/CodeGen/AMDGPU/atomic_optimizations_local_pointer.ll (+1162-1063)
  • (modified) llvm/test/CodeGen/AMDGPU/atomic_optimizations_raw_buffer.ll (+652-554)
  • (modified) llvm/test/CodeGen/AMDGPU/atomic_optimizations_struct_buffer.ll (+722-626)
  • (modified) llvm/test/CodeGen/AMDGPU/atomics_cond_sub.ll (+24-24)
  • (modified) llvm/test/CodeGen/AMDGPU/attr-amdgpu-waves-per-eu.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/attributor-noopt.ll (+3-3)
  • (modified) llvm/test/CodeGen/AMDGPU/bf16.ll (+280-280)
  • (modified) llvm/test/CodeGen/AMDGPU/bfe-combine.ll (+18-18)
  • (modified) llvm/test/CodeGen/AMDGPU/bfe-patterns.ll (+28-28)
  • (modified) llvm/test/CodeGen/AMDGPU/bfi_int.ll (+115-121)
  • (modified) llvm/test/CodeGen/AMDGPU/bfi_nested.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/bfm.ll (+8-8)
  • (modified) llvm/test/CodeGen/AMDGPU/bitreverse.ll (+89-73)
  • (modified) llvm/test/CodeGen/AMDGPU/br_cc.f16.ll (+32-32)
  • (modified) llvm/test/CodeGen/AMDGPU/branch-relax-spill.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/branch-relaxation.ll (+46-46)
  • (modified) llvm/test/CodeGen/AMDGPU/bswap.ll (+21-21)
  • (modified) llvm/test/CodeGen/AMDGPU/buffer-rsrc-ptr-ops.ll (+14-14)
  • (modified) llvm/test/CodeGen/AMDGPU/build_vector.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/call-constexpr.ll (+3-2)
  • (modified) llvm/test/CodeGen/AMDGPU/call-graph-register-usage.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/call-reqd-group-size.ll (+18-18)
  • (modified) llvm/test/CodeGen/AMDGPU/callee-special-input-sgprs-fixed-abi.ll (+4-2)
  • (modified) llvm/test/CodeGen/AMDGPU/callee-special-input-vgprs-packed.ll (+4-2)
  • (modified) llvm/test/CodeGen/AMDGPU/callee-special-input-vgprs.ll (+3-1)
  • (modified) llvm/test/CodeGen/AMDGPU/calling-conventions.ll (+135-77)
  • (modified) llvm/test/CodeGen/AMDGPU/carryout-selection.ll (+394-382)
  • (modified) llvm/test/CodeGen/AMDGPU/cc-update.ll (+9-9)
  • (modified) llvm/test/CodeGen/AMDGPU/cf-loop-on-constant.ll (+7-7)
  • (modified) llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-gfx1030.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-gfx908.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/cgp-bitfield-extract.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/chain-hi-to-lo.ll (+13-13)
  • (modified) llvm/test/CodeGen/AMDGPU/clamp-modifier.ll (+133-101)
  • (modified) llvm/test/CodeGen/AMDGPU/clamp.ll (+455-272)
  • (modified) llvm/test/CodeGen/AMDGPU/cluster_stores.ll (+6-6)
  • (modified) llvm/test/CodeGen/AMDGPU/coalesce-vgpr-alignment.ll (+4-2)
  • (modified) llvm/test/CodeGen/AMDGPU/code-object-v3.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/codegen-internal-only-func.ll (+3-23)
  • (modified) llvm/test/CodeGen/AMDGPU/collapse-endcf.ll (+7-7)
  • (modified) llvm/test/CodeGen/AMDGPU/combine-cond-add-sub.ll (+113-113)
  • (modified) llvm/test/CodeGen/AMDGPU/combine-reg-or-const.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/combine-vload-extract.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/copy-illegal-type.ll (+74-76)
  • (modified) llvm/test/CodeGen/AMDGPU/copy-to-reg-scc-clobber.ll (+16-16)
  • (modified) llvm/test/CodeGen/AMDGPU/copy_to_scc.ll (+3-3)
  • (modified) llvm/test/CodeGen/AMDGPU/ctlz.ll (+144-126)
  • (modified) llvm/test/CodeGen/AMDGPU/ctlz_zero_undef.ll (+122-122)
  • (modified) llvm/test/CodeGen/AMDGPU/ctpop16.ll (+44-44)
  • (modified) llvm/test/CodeGen/AMDGPU/ctpop64.ll (+62-62)
  • (modified) llvm/test/CodeGen/AMDGPU/cttz.ll (+84-84)
  • (modified) llvm/test/CodeGen/AMDGPU/cttz_zero_undef.ll (+83-83)
  • (modified) llvm/test/CodeGen/AMDGPU/cvt_f32_ubyte.ll (+141-110)
  • (modified) llvm/test/CodeGen/AMDGPU/dag-divergence-atomic.ll (+102-99)
  • (modified) llvm/test/CodeGen/AMDGPU/dagcomb-extract-vec-elt-different-sizes.ll (+3-3)
  • (modified) llvm/test/CodeGen/AMDGPU/dagcombine-setcc-select.ll (+4-4)
  • (modified) llvm/test/CodeGen/AMDGPU/divergence-driven-buildvector.ll (+76-76)
  • (modified) llvm/test/CodeGen/AMDGPU/divergence-driven-sext-inreg.ll (+4-4)
  • (modified) llvm/test/CodeGen/AMDGPU/divergence-driven-trunc-to-i1.ll (+6-6)
  • (modified) llvm/test/CodeGen/AMDGPU/ds-alignment.ll (+45-45)
  • (modified) llvm/test/CodeGen/AMDGPU/ds-combine-large-stride.ll (+13-13)
  • (modified) llvm/test/CodeGen/AMDGPU/ds-combine-with-dependence.ll (+6-4)
  • (modified) llvm/test/CodeGen/AMDGPU/ds-sub-offset.ll (+49-36)
  • (modified) llvm/test/CodeGen/AMDGPU/ds_read2.ll (+117-126)
  • (modified) llvm/test/CodeGen/AMDGPU/ds_write2.ll (+75-75)
  • (modified) llvm/test/CodeGen/AMDGPU/early-inline.ll (+1)
  • (modified) llvm/test/CodeGen/AMDGPU/elf-notes.ll (+3-1)
  • (modified) llvm/test/CodeGen/AMDGPU/exec-mask-opt-cannot-create-empty-or-backward-segment.ll (+5-5)
  • (modified) llvm/test/CodeGen/AMDGPU/expand-scalar-carry-out-select-user.ll (+4-4)
  • (modified) llvm/test/CodeGen/AMDGPU/extract_vector_dynelt.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/extract_vector_elt-f16.ll (+112-101)
  • (modified) llvm/test/CodeGen/AMDGPU/extract_vector_elt-i16.ll (+4-4)
  • (modified) llvm/test/CodeGen/AMDGPU/extract_vector_elt-i8.ll (+6-6)
  • (modified) llvm/test/CodeGen/AMDGPU/extractelt-to-trunc.ll (+10-10)
  • (modified) llvm/test/CodeGen/AMDGPU/fabs.f16.ll (+88-79)
  • (modified) llvm/test/CodeGen/AMDGPU/fadd.f16.ll (+78-54)
  • (modified) llvm/test/CodeGen/AMDGPU/fast-unaligned-load-store.global.ll (+18-18)
  • (modified) llvm/test/CodeGen/AMDGPU/fcanonicalize.f16.ll (+224-216)
  • (modified) llvm/test/CodeGen/AMDGPU/fcanonicalize.ll (+274-238)
  • (modified) llvm/test/CodeGen/AMDGPU/fcmp.f16.ll (+466-466)
  • (modified) llvm/test/CodeGen/AMDGPU/fcopysign.f16.ll (+311-290)
  • (modified) llvm/test/CodeGen/AMDGPU/fcopysign.f32.ll (+169-168)
  • (modified) llvm/test/CodeGen/AMDGPU/fcopysign.f64.ll (+218-219)
  • (modified) llvm/test/CodeGen/AMDGPU/fdiv.f16.ll (+155-127)
  • (modified) llvm/test/CodeGen/AMDGPU/fdiv.ll (+148-142)
  • (modified) llvm/test/CodeGen/AMDGPU/fdiv32-to-rcp-folding.ll (+46-46)
  • (modified) llvm/test/CodeGen/AMDGPU/flat-scratch-init.ll (+26-26)
  • (modified) llvm/test/CodeGen/AMDGPU/flat-scratch-svs.ll (+242-182)
  • (modified) llvm/test/CodeGen/AMDGPU/flat-scratch.ll (+371-343)
  • (modified) llvm/test/CodeGen/AMDGPU/flat_atomics.ll (+1613-1613)
  • (modified) llvm/test/CodeGen/AMDGPU/flat_atomics_i32_system.ll (+129-129)
  • (modified) llvm/test/CodeGen/AMDGPU/flat_atomics_i64.ll (+552-552)
  • (modified) llvm/test/CodeGen/AMDGPU/flat_atomics_i64_system.ll (+49-49)
  • (modified) llvm/test/CodeGen/AMDGPU/fma-combine.ll (+411-377)
  • (modified) llvm/test/CodeGen/AMDGPU/fma.ll (+4-4)
  • (modified) llvm/test/CodeGen/AMDGPU/fmax3.ll (+16-16)
  • (modified) llvm/test/CodeGen/AMDGPU/fmax_legacy.f64.ll (+8-8)
  • (modified) llvm/test/CodeGen/AMDGPU/fmaximum.ll (+4-4)
  • (modified) llvm/test/CodeGen/AMDGPU/fmed3.ll (+470-348)
  • (modified) llvm/test/CodeGen/AMDGPU/fmin3.ll (+24-24)
  • (modified) llvm/test/CodeGen/AMDGPU/fmin_legacy.f64.ll (+16-16)
  • (modified) llvm/test/CodeGen/AMDGPU/fminimum.ll (+4-4)
  • (modified) llvm/test/CodeGen/AMDGPU/fmul-2-combine-multi-use.ll (+162-162)
  • (modified) llvm/test/CodeGen/AMDGPU/fmul.f16.ll (+122-122)
  • (modified) llvm/test/CodeGen/AMDGPU/fmuladd.f16.ll (+220-136)
  • (modified) llvm/test/CodeGen/AMDGPU/fnearbyint.ll (+62-61)
  • (modified) llvm/test/CodeGen/AMDGPU/fneg-combines.new.ll (+36-36)
  • (modified) llvm/test/CodeGen/AMDGPU/fneg-fabs.f16.ll (+89-89)
  • (modified) llvm/test/CodeGen/AMDGPU/fneg-fabs.f64.ll (+3-1)
  • (modified) llvm/test/CodeGen/AMDGPU/fneg-fabs.ll (+3-3)
  • (modified) llvm/test/CodeGen/AMDGPU/fneg-modifier-casting.ll (+14-14)
  • (modified) llvm/test/CodeGen/AMDGPU/fneg.f16.ll (+66-62)
  • (modified) llvm/test/CodeGen/AMDGPU/fneg.ll (+3-3)
  • (modified) llvm/test/CodeGen/AMDGPU/force-store-sc0-sc1.ll (+10-10)
  • (modified) llvm/test/CodeGen/AMDGPU/fp-atomics-gfx1200.ll (+10-10)
  • (modified) llvm/test/CodeGen/AMDGPU/fp-atomics-gfx940.ll (+16-16)
  • (modified) llvm/test/CodeGen/AMDGPU/fp-classify.ll (+181-181)
  • (modified) llvm/test/CodeGen/AMDGPU/fp-min-max-buffer-atomics.ll (+113-120)
  • (modified) llvm/test/CodeGen/AMDGPU/fp-min-max-buffer-ptr-atomics.ll (+113-120)
  • (modified) llvm/test/CodeGen/AMDGPU/fp16_to_fp32.ll (+3-3)
  • (modified) llvm/test/CodeGen/AMDGPU/fp16_to_fp64.ll (+3-3)
  • (modified) llvm/test/CodeGen/AMDGPU/fp32_to_fp16.ll (+3-3)
  • (modified) llvm/test/CodeGen/AMDGPU/fp64-atomics-gfx90a.ll (+282-282)
  • (modified) llvm/test/CodeGen/AMDGPU/fp64-min-max-buffer-atomics.ll (+64-64)
  • (modified) llvm/test/CodeGen/AMDGPU/fp64-min-max-buffer-ptr-atomics.ll (+64-64)
  • (modified) llvm/test/CodeGen/AMDGPU/fp_to_sint.ll (+59-59)
  • (modified) llvm/test/CodeGen/AMDGPU/fp_to_uint.ll (+52-52)
  • (modified) llvm/test/CodeGen/AMDGPU/fpext.f16.ll (+60-68)
  • (modified) llvm/test/CodeGen/AMDGPU/fptosi.f16.ll (+25-25)
  • (modified) llvm/test/CodeGen/AMDGPU/fptoui.f16.ll (+28-27)
  • (modified) llvm/test/CodeGen/AMDGPU/fptrunc.f16.ll (+80-80)
  • (modified) llvm/test/CodeGen/AMDGPU/fptrunc.ll (+96-94)
  • (modified) llvm/test/CodeGen/AMDGPU/frem.ll (+224-224)
  • (modified) llvm/test/CodeGen/AMDGPU/fshl.ll (+171-173)
  • (modified) llvm/test/CodeGen/AMDGPU/fshr.ll (+103-105)
  • (modified) llvm/test/CodeGen/AMDGPU/fsqrt.f32.ll (+92-89)
  • (modified) llvm/test/CodeGen/AMDGPU/fsub.f16.ll (+78-78)
  • (modified) llvm/test/CodeGen/AMDGPU/function-args-inreg.ll (+685-664)
  • (modified) llvm/test/CodeGen/AMDGPU/fused-bitlogic.ll (+12-12)
  • (modified) llvm/test/CodeGen/AMDGPU/gds-allocation.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/gep-const-address-space.ll (+4-4)
  • (modified) llvm/test/CodeGen/AMDGPU/gfx11-user-sgpr-init16-bug.ll (+17-13)
  • (modified) llvm/test/CodeGen/AMDGPU/global-atomics-fp-wrong-subtarget.ll (+8-8)
  • (modified) llvm/test/CodeGen/AMDGPU/global-atomics-fp.ll (+354-354)
  • (modified) llvm/test/CodeGen/AMDGPU/global-constant.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/global-i16-load-store.ll (+12-12)
  • (modified) llvm/test/CodeGen/AMDGPU/global-load-saddr-to-vaddr.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/global_atomics.ll (+1319-1319)
  • (modified) llvm/test/CodeGen/AMDGPU/global_atomics_i32_system.ll (+122-122)
  • (modified) llvm/test/CodeGen/AMDGPU/global_atomics_i64.ll (+812-812)
  • (modified) llvm/test/CodeGen/AMDGPU/global_atomics_i64_system.ll (+53-53)
  • (modified) llvm/test/CodeGen/AMDGPU/global_atomics_scan_fadd.ll (+485-485)
  • (modified) llvm/test/CodeGen/AMDGPU/global_atomics_scan_fmax.ll (+126-126)
  • (modified) llvm/test/CodeGen/AMDGPU/global_atomics_scan_fmin.ll (+126-126)
  • (modified) llvm/test/CodeGen/AMDGPU/global_atomics_scan_fsub.ll (+495-495)
  • (modified) llvm/test/CodeGen/AMDGPU/global_smrd.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/half.ll (+183-183)
  • (modified) llvm/test/CodeGen/AMDGPU/hsa-metadata-agpr-register-count.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/hsa-metadata-heap-v5.ll (+3-2)
  • (modified) llvm/test/CodeGen/AMDGPU/hsa-metadata-hostcall-v4.ll (+3-2)
  • (modified) llvm/test/CodeGen/AMDGPU/hsa-metadata-hostcall-v5.ll (+3-2)
  • (modified) llvm/test/CodeGen/AMDGPU/hsa-metadata-kernel-code-props.ll (+34-24)
  • (modified) llvm/test/CodeGen/AMDGPU/hsa-metadata-multigrid-sync-arg-v5.ll (+3-2)
  • (modified) llvm/test/CodeGen/AMDGPU/hsa-metadata-queue-ptr-v5.ll (+9-7)
  • (modified) llvm/test/CodeGen/AMDGPU/hsa-metadata-queueptr-v5.ll (+3-2)
  • (modified) llvm/test/CodeGen/AMDGPU/hsa-metadata-resource-usage-function-ordering.ll (+3-2)
  • (modified) llvm/test/CodeGen/AMDGPU/hsa.ll (+4-2)
  • (modified) llvm/test/CodeGen/AMDGPU/idiv-licm.ll (+230-226)
  • (modified) llvm/test/CodeGen/AMDGPU/idot2.ll (+347-328)
  • (modified) llvm/test/CodeGen/AMDGPU/idot4s.ll (+394-340)
  • (modified) llvm/test/CodeGen/AMDGPU/idot4u.ll (+736-638)
  • (modified) llvm/test/CodeGen/AMDGPU/idot8s.ll (+367-355)
  • (modified) llvm/test/CodeGen/AMDGPU/idot8u.ll (+466-459)
  • (modified) llvm/test/CodeGen/AMDGPU/imm.ll (+266-234)
  • (modified) llvm/test/CodeGen/AMDGPU/imm16.ll (+272-258)
  • (modified) llvm/test/CodeGen/AMDGPU/immv216.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/implicit-kernarg-backend-usage.ll (+20-20)
  • (modified) llvm/test/CodeGen/AMDGPU/implicitarg-attributes.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/indirect-call-known-callees.ll (+43-39)
  • (modified) llvm/test/CodeGen/AMDGPU/infinite-loop.ll (+6-10)
  • (modified) llvm/test/CodeGen/AMDGPU/inline-asm.i128.ll (+12-12)
  • (modified) llvm/test/CodeGen/AMDGPU/inline-attr.ll (+10-7)
  • (modified) llvm/test/CodeGen/AMDGPU/inlineasm-packed.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/insert_vector_dynelt.ll (+368-366)
  • (modified) llvm/test/CodeGen/AMDGPU/insert_vector_elt.ll (+426-426)
  • (modified) llvm/test/CodeGen/AMDGPU/insert_vector_elt.v2i16.ll (+344-283)
  • (modified) llvm/test/CodeGen/AMDGPU/ipra.ll (+4-4)
  • (modified) llvm/test/CodeGen/AMDGPU/kernarg-size.ll (+5-2)
  • (modified) llvm/test/CodeGen/AMDGPU/kernel-args.ll (+468-465)
  • (modified) llvm/test/CodeGen/AMDGPU/kernel-argument-dag-lowering.ll (+50-48)
  • (modified) llvm/test/CodeGen/AMDGPU/kill-infinite-loop.ll (+3-3)
  • (modified) llvm/test/CodeGen/AMDGPU/large-alloca-compute.ll (+7-7)
  • (modified) llvm/test/CodeGen/AMDGPU/lds-atomic-fmin-fmax.ll ()
  • (modified) llvm/test/CodeGen/AMDGPU/lds-frame-extern.ll ()
  • (modified) llvm/test/CodeGen/AMDGPU/lds-global-non-entry-func.ll ()
  • (modified) llvm/test/CodeGen/AMDGPU/lds-zero-initializer.ll (+2-2)
  • (modified) llvm/test/CodeGen/AMDGPU/llc-pipeline.ll (-12)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.atomic.cond.sub.ll (+12-12)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.ballot.i64.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.cvt.pk.i16.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.cvt.pk.u16.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.cvt.pknorm.i16.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.cvt.pknorm.u16.ll (+1-1)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.cvt.pkrtz.ll (+103-89)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.dispatch.id.ll (+36-1)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.ds.gws.barrier-fastregalloc.ll (+2)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.exp.row.ll (+7-2)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.fcmp.w32.ll (+556-556)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.fcmp.w64.ll (+534-534)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.fdot2.bf16.bf16.ll (+14-13)
  • (modified) llvm/test/CodeGen/AMDGPU/llvm.amdgcn.fdot2.f16.f16.ll (+3-3)
diff --git a/llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp b/llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp
index 0d830df1f1f1df..3373942a7782b6 100644
--- a/llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp
+++ b/llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp
@@ -781,6 +781,14 @@ void AMDGPUTargetMachine::registerPassBuilderCallbacks(
 
         PM.addPass(createCGSCCToFunctionPassAdaptor(std::move(FPM)));
       });
+
+  // FIXME: Why is AMDGPUAttributor not in CGSCC?
+  PB.registerOptimizerLastEPCallback(
+      [this](ModulePassManager &MPM, OptimizationLevel Level) {
+        if (Level != OptimizationLevel::O0) {
+          MPM.addPass(AMDGPUAttributorPass(*this));
+        }
+      });
 }
 
 int64_t AMDGPUTargetMachine::getNullPointerValue(unsigned AddrSpace) {
@@ -1043,11 +1051,6 @@ void AMDGPUPassConfig::addIRPasses() {
     addPass(createAMDGPULowerModuleLDSLegacyPass(&TM));
   }
 
-  // AMDGPUAttributor infers lack of llvm.amdgcn.lds.kernel.id calls, so run
-  // after their introduction
-  if (TM.getOptLevel() > CodeGenOptLevel::None)
-    addPass(createAMDGPUAttributorLegacyPass());
-
   if (TM.getOptLevel() > CodeGenOptLevel::None)
     addPass(createInferAddressSpacesPass());
 
diff --git a/llvm/lib/Target/AMDGPU/SIFrameLowering.cpp b/llvm/lib/Target/AMDGPU/SIFrameLowering.cpp
index 4f106bf0dfb114..b22b6609c68673 100644
--- a/llvm/lib/Target/AMDGPU/SIFrameLowering.cpp
+++ b/llvm/lib/Target/AMDGPU/SIFrameLowering.cpp
@@ -678,6 +678,12 @@ void SIFrameLowering::emitEntryFunctionPrologue(MachineFunction &MF,
         break;
       }
     }
+
+    // FIXME: We can spill incoming arguments and restore at the end of the
+    // prolog.
+    if (!ScratchWaveOffsetReg)
+      report_fatal_error(
+          "could not find temporary scratch offset register in prolog");
   } else {
     ScratchWaveOffsetReg = PreloadedScratchWaveOffsetReg;
   }
diff --git a/llvm/test/CodeGen/AMDGPU/GlobalISel/addsubu64.ll b/llvm/test/CodeGen/AMDGPU/GlobalISel/addsubu64.ll
index a38b6e3263882c..359c1e53de99e3 100644
--- a/llvm/test/CodeGen/AMDGPU/GlobalISel/addsubu64.ll
+++ b/llvm/test/CodeGen/AMDGPU/GlobalISel/addsubu64.ll
@@ -6,8 +6,8 @@ define amdgpu_kernel void @s_add_u64(ptr addrspace(1) %out, i64 %a, i64 %b) {
 ; GFX11-LABEL: s_add_u64:
 ; GFX11:       ; %bb.0: ; %entry
 ; GFX11-NEXT:    s_clause 0x1
-; GFX11-NEXT:    s_load_b128 s[4:7], s[0:1], 0x24
-; GFX11-NEXT:    s_load_b64 s[0:1], s[0:1], 0x34
+; GFX11-NEXT:    s_load_b128 s[4:7], s[2:3], 0x24
+; GFX11-NEXT:    s_load_b64 s[0:1], s[2:3], 0x34
 ; GFX11-NEXT:    v_mov_b32_e32 v2, 0
 ; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX11-NEXT:    s_add_u32 s0, s6, s0
@@ -22,8 +22,8 @@ define amdgpu_kernel void @s_add_u64(ptr addrspace(1) %out, i64 %a, i64 %b) {
 ; GFX12-LABEL: s_add_u64:
 ; GFX12:       ; %bb.0: ; %entry
 ; GFX12-NEXT:    s_clause 0x1
-; GFX12-NEXT:    s_load_b128 s[4:7], s[0:1], 0x24
-; GFX12-NEXT:    s_load_b64 s[0:1], s[0:1], 0x34
+; GFX12-NEXT:    s_load_b128 s[4:7], s[2:3], 0x24
+; GFX12-NEXT:    s_load_b64 s[0:1], s[2:3], 0x34
 ; GFX12-NEXT:    v_mov_b32_e32 v2, 0
 ; GFX12-NEXT:    s_wait_kmcnt 0x0
 ; GFX12-NEXT:    s_add_nc_u64 s[0:1], s[6:7], s[0:1]
@@ -58,8 +58,8 @@ define amdgpu_kernel void @s_sub_u64(ptr addrspace(1) %out, i64 %a, i64 %b) {
 ; GFX11-LABEL: s_sub_u64:
 ; GFX11:       ; %bb.0: ; %entry
 ; GFX11-NEXT:    s_clause 0x1
-; GFX11-NEXT:    s_load_b128 s[4:7], s[0:1], 0x24
-; GFX11-NEXT:    s_load_b64 s[0:1], s[0:1], 0x34
+; GFX11-NEXT:    s_load_b128 s[4:7], s[2:3], 0x24
+; GFX11-NEXT:    s_load_b64 s[0:1], s[2:3], 0x34
 ; GFX11-NEXT:    v_mov_b32_e32 v2, 0
 ; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX11-NEXT:    s_sub_u32 s0, s6, s0
@@ -74,8 +74,8 @@ define amdgpu_kernel void @s_sub_u64(ptr addrspace(1) %out, i64 %a, i64 %b) {
 ; GFX12-LABEL: s_sub_u64:
 ; GFX12:       ; %bb.0: ; %entry
 ; GFX12-NEXT:    s_clause 0x1
-; GFX12-NEXT:    s_load_b128 s[4:7], s[0:1], 0x24
-; GFX12-NEXT:    s_load_b64 s[0:1], s[0:1], 0x34
+; GFX12-NEXT:    s_load_b128 s[4:7], s[2:3], 0x24
+; GFX12-NEXT:    s_load_b64 s[0:1], s[2:3], 0x34
 ; GFX12-NEXT:    v_mov_b32_e32 v2, 0
 ; GFX12-NEXT:    s_wait_kmcnt 0x0
 ; GFX12-NEXT:    s_sub_nc_u64 s[0:1], s[6:7], s[0:1]
diff --git a/llvm/test/CodeGen/AMDGPU/GlobalISel/atomicrmw_udec_wrap.ll b/llvm/test/CodeGen/AMDGPU/GlobalISel/atomicrmw_udec_wrap.ll
index b04bc04ab22691..705bcbddf227a6 100644
--- a/llvm/test/CodeGen/AMDGPU/GlobalISel/atomicrmw_udec_wrap.ll
+++ b/llvm/test/CodeGen/AMDGPU/GlobalISel/atomicrmw_udec_wrap.ll
@@ -16,8 +16,8 @@ declare i32 @llvm.amdgcn.workitem.id.x() #0
 define amdgpu_kernel void @lds_atomic_dec_ret_i32(ptr addrspace(1) %out, ptr addrspace(3) %ptr) #1 {
 ; CI-LABEL: lds_atomic_dec_ret_i32:
 ; CI:       ; %bb.0:
-; CI-NEXT:    s_load_dword s2, s[4:5], 0x2
-; CI-NEXT:    s_load_dwordx2 s[0:1], s[4:5], 0x0
+; CI-NEXT:    s_load_dword s2, s[6:7], 0x2
+; CI-NEXT:    s_load_dwordx2 s[0:1], s[6:7], 0x0
 ; CI-NEXT:    v_mov_b32_e32 v0, 42
 ; CI-NEXT:    s_mov_b32 m0, -1
 ; CI-NEXT:    s_waitcnt lgkmcnt(0)
@@ -31,8 +31,8 @@ define amdgpu_kernel void @lds_atomic_dec_ret_i32(ptr addrspace(1) %out, ptr add
 ;
 ; VI-LABEL: lds_atomic_dec_ret_i32:
 ; VI:       ; %bb.0:
-; VI-NEXT:    s_load_dword s2, s[4:5], 0x8
-; VI-NEXT:    s_load_dwordx2 s[0:1], s[4:5], 0x0
+; VI-NEXT:    s_load_dword s2, s[6:7], 0x8
+; VI-NEXT:    s_load_dwordx2 s[0:1], s[6:7], 0x0
 ; VI-NEXT:    v_mov_b32_e32 v0, 42
 ; VI-NEXT:    s_mov_b32 m0, -1
 ; VI-NEXT:    s_waitcnt lgkmcnt(0)
@@ -46,8 +46,8 @@ define amdgpu_kernel void @lds_atomic_dec_ret_i32(ptr addrspace(1) %out, ptr add
 ;
 ; GFX9-LABEL: lds_atomic_dec_ret_i32:
 ; GFX9:       ; %bb.0:
-; GFX9-NEXT:    s_load_dword s2, s[4:5], 0x8
-; GFX9-NEXT:    s_load_dwordx2 s[0:1], s[4:5], 0x0
+; GFX9-NEXT:    s_load_dword s2, s[6:7], 0x8
+; GFX9-NEXT:    s_load_dwordx2 s[0:1], s[6:7], 0x0
 ; GFX9-NEXT:    v_mov_b32_e32 v1, 42
 ; GFX9-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX9-NEXT:    v_mov_b32_e32 v0, s2
@@ -59,11 +59,11 @@ define amdgpu_kernel void @lds_atomic_dec_ret_i32(ptr addrspace(1) %out, ptr add
 ;
 ; GFX10-LABEL: lds_atomic_dec_ret_i32:
 ; GFX10:       ; %bb.0:
-; GFX10-NEXT:    s_load_dword s0, s[4:5], 0x8
+; GFX10-NEXT:    s_load_dword s0, s[6:7], 0x8
 ; GFX10-NEXT:    v_mov_b32_e32 v1, 42
 ; GFX10-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX10-NEXT:    v_mov_b32_e32 v0, s0
-; GFX10-NEXT:    s_load_dwordx2 s[0:1], s[4:5], 0x0
+; GFX10-NEXT:    s_load_dwordx2 s[0:1], s[6:7], 0x0
 ; GFX10-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX10-NEXT:    ds_dec_rtn_u32 v0, v0, v1
 ; GFX10-NEXT:    s_waitcnt lgkmcnt(0)
@@ -74,11 +74,11 @@ define amdgpu_kernel void @lds_atomic_dec_ret_i32(ptr addrspace(1) %out, ptr add
 ;
 ; GFX11-LABEL: lds_atomic_dec_ret_i32:
 ; GFX11:       ; %bb.0:
-; GFX11-NEXT:    s_clause 0x1
-; GFX11-NEXT:    s_load_b32 s2, s[0:1], 0x8
-; GFX11-NEXT:    s_load_b64 s[0:1], s[0:1], 0x0
+; GFX11-NEXT:    s_load_b32 s0, s[2:3], 0x8
+; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
+; GFX11-NEXT:    v_dual_mov_b32 v1, 42 :: v_dual_mov_b32 v0, s0
+; GFX11-NEXT:    s_load_b64 s[0:1], s[2:3], 0x0
 ; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
-; GFX11-NEXT:    v_dual_mov_b32 v1, 42 :: v_dual_mov_b32 v0, s2
 ; GFX11-NEXT:    ds_dec_rtn_u32 v0, v0, v1
 ; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX11-NEXT:    buffer_gl0_inv
@@ -95,8 +95,8 @@ define amdgpu_kernel void @lds_atomic_dec_ret_i32(ptr addrspace(1) %out, ptr add
 define amdgpu_kernel void @lds_atomic_dec_ret_i32_offset(ptr addrspace(1) %out, ptr addrspace(3) %ptr) #1 {
 ; CI-LABEL: lds_atomic_dec_ret_i32_offset:
 ; CI:       ; %bb.0:
-; CI-NEXT:    s_load_dword s2, s[4:5], 0x2
-; CI-NEXT:    s_load_dwordx2 s[0:1], s[4:5], 0x0
+; CI-NEXT:    s_load_dword s2, s[6:7], 0x2
+; CI-NEXT:    s_load_dwordx2 s[0:1], s[6:7], 0x0
 ; CI-NEXT:    v_mov_b32_e32 v0, 42
 ; CI-NEXT:    s_mov_b32 m0, -1
 ; CI-NEXT:    s_waitcnt lgkmcnt(0)
@@ -110,8 +110,8 @@ define amdgpu_kernel void @lds_atomic_dec_ret_i32_offset(ptr addrspace(1) %out,
 ;
 ; VI-LABEL: lds_atomic_dec_ret_i32_offset:
 ; VI:       ; %bb.0:
-; VI-NEXT:    s_load_dword s2, s[4:5], 0x8
-; VI-NEXT:    s_load_dwordx2 s[0:1], s[4:5], 0x0
+; VI-NEXT:    s_load_dword s2, s[6:7], 0x8
+; VI-NEXT:    s_load_dwordx2 s[0:1], s[6:7], 0x0
 ; VI-NEXT:    v_mov_b32_e32 v0, 42
 ; VI-NEXT:    s_mov_b32 m0, -1
 ; VI-NEXT:    s_waitcnt lgkmcnt(0)
@@ -125,8 +125,8 @@ define amdgpu_kernel void @lds_atomic_dec_ret_i32_offset(ptr addrspace(1) %out,
 ;
 ; GFX9-LABEL: lds_atomic_dec_ret_i32_offset:
 ; GFX9:       ; %bb.0:
-; GFX9-NEXT:    s_load_dword s2, s[4:5], 0x8
-; GFX9-NEXT:    s_load_dwordx2 s[0:1], s[4:5], 0x0
+; GFX9-NEXT:    s_load_dword s2, s[6:7], 0x8
+; GFX9-NEXT:    s_load_dwordx2 s[0:1], s[6:7], 0x0
 ; GFX9-NEXT:    v_mov_b32_e32 v0, 42
 ; GFX9-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX9-NEXT:    v_mov_b32_e32 v1, s2
@@ -138,11 +138,11 @@ define amdgpu_kernel void @lds_atomic_dec_ret_i32_offset(ptr addrspace(1) %out,
 ;
 ; GFX10-LABEL: lds_atomic_dec_ret_i32_offset:
 ; GFX10:       ; %bb.0:
-; GFX10-NEXT:    s_load_dword s0, s[4:5], 0x8
+; GFX10-NEXT:    s_load_dword s0, s[6:7], 0x8
 ; GFX10-NEXT:    v_mov_b32_e32 v0, 42
 ; GFX10-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX10-NEXT:    v_mov_b32_e32 v1, s0
-; GFX10-NEXT:    s_load_dwordx2 s[0:1], s[4:5], 0x0
+; GFX10-NEXT:    s_load_dwordx2 s[0:1], s[6:7], 0x0
 ; GFX10-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX10-NEXT:    ds_dec_rtn_u32 v0, v1, v0 offset:16
 ; GFX10-NEXT:    s_waitcnt lgkmcnt(0)
@@ -153,11 +153,11 @@ define amdgpu_kernel void @lds_atomic_dec_ret_i32_offset(ptr addrspace(1) %out,
 ;
 ; GFX11-LABEL: lds_atomic_dec_ret_i32_offset:
 ; GFX11:       ; %bb.0:
-; GFX11-NEXT:    s_load_b32 s2, s[0:1], 0x8
-; GFX11-NEXT:    v_mov_b32_e32 v0, 42
-; GFX11-NEXT:    s_load_b64 s[0:1], s[0:1], 0x0
+; GFX11-NEXT:    s_load_b32 s0, s[2:3], 0x8
+; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
+; GFX11-NEXT:    v_dual_mov_b32 v0, 42 :: v_dual_mov_b32 v1, s0
+; GFX11-NEXT:    s_load_b64 s[0:1], s[2:3], 0x0
 ; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
-; GFX11-NEXT:    v_mov_b32_e32 v1, s2
 ; GFX11-NEXT:    ds_dec_rtn_u32 v0, v1, v0 offset:16
 ; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX11-NEXT:    buffer_gl0_inv
@@ -175,7 +175,7 @@ define amdgpu_kernel void @lds_atomic_dec_ret_i32_offset(ptr addrspace(1) %out,
 define amdgpu_kernel void @lds_atomic_dec_noret_i32(ptr addrspace(3) %ptr) #1 {
 ; CI-LABEL: lds_atomic_dec_noret_i32:
 ; CI:       ; %bb.0:
-; CI-NEXT:    s_load_dword s0, s[4:5], 0x0
+; CI-NEXT:    s_load_dword s0, s[6:7], 0x0
 ; CI-NEXT:    v_mov_b32_e32 v0, 42
 ; CI-NEXT:    s_mov_b32 m0, -1
 ; CI-NEXT:    s_waitcnt lgkmcnt(0)
@@ -186,7 +186,7 @@ define amdgpu_kernel void @lds_atomic_dec_noret_i32(ptr addrspace(3) %ptr) #1 {
 ;
 ; VI-LABEL: lds_atomic_dec_noret_i32:
 ; VI:       ; %bb.0:
-; VI-NEXT:    s_load_dword s0, s[4:5], 0x0
+; VI-NEXT:    s_load_dword s0, s[6:7], 0x0
 ; VI-NEXT:    v_mov_b32_e32 v0, 42
 ; VI-NEXT:    s_mov_b32 m0, -1
 ; VI-NEXT:    s_waitcnt lgkmcnt(0)
@@ -197,7 +197,7 @@ define amdgpu_kernel void @lds_atomic_dec_noret_i32(ptr addrspace(3) %ptr) #1 {
 ;
 ; GFX9-LABEL: lds_atomic_dec_noret_i32:
 ; GFX9:       ; %bb.0:
-; GFX9-NEXT:    s_load_dword s0, s[4:5], 0x0
+; GFX9-NEXT:    s_load_dword s0, s[6:7], 0x0
 ; GFX9-NEXT:    v_mov_b32_e32 v1, 42
 ; GFX9-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX9-NEXT:    v_mov_b32_e32 v0, s0
@@ -207,7 +207,7 @@ define amdgpu_kernel void @lds_atomic_dec_noret_i32(ptr addrspace(3) %ptr) #1 {
 ;
 ; GFX10-LABEL: lds_atomic_dec_noret_i32:
 ; GFX10:       ; %bb.0:
-; GFX10-NEXT:    s_load_dword s0, s[4:5], 0x0
+; GFX10-NEXT:    s_load_dword s0, s[6:7], 0x0
 ; GFX10-NEXT:    v_mov_b32_e32 v1, 42
 ; GFX10-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX10-NEXT:    v_mov_b32_e32 v0, s0
@@ -218,7 +218,7 @@ define amdgpu_kernel void @lds_atomic_dec_noret_i32(ptr addrspace(3) %ptr) #1 {
 ;
 ; GFX11-LABEL: lds_atomic_dec_noret_i32:
 ; GFX11:       ; %bb.0:
-; GFX11-NEXT:    s_load_b32 s0, s[0:1], 0x0
+; GFX11-NEXT:    s_load_b32 s0, s[2:3], 0x0
 ; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX11-NEXT:    v_dual_mov_b32 v1, 42 :: v_dual_mov_b32 v0, s0
 ; GFX11-NEXT:    ds_dec_u32 v0, v1
@@ -232,7 +232,7 @@ define amdgpu_kernel void @lds_atomic_dec_noret_i32(ptr addrspace(3) %ptr) #1 {
 define amdgpu_kernel void @lds_atomic_dec_noret_i32_offset(ptr addrspace(3) %ptr) #1 {
 ; CI-LABEL: lds_atomic_dec_noret_i32_offset:
 ; CI:       ; %bb.0:
-; CI-NEXT:    s_load_dword s0, s[4:5], 0x0
+; CI-NEXT:    s_load_dword s0, s[6:7], 0x0
 ; CI-NEXT:    v_mov_b32_e32 v0, 42
 ; CI-NEXT:    s_mov_b32 m0, -1
 ; CI-NEXT:    s_waitcnt lgkmcnt(0)
@@ -243,7 +243,7 @@ define amdgpu_kernel void @lds_atomic_dec_noret_i32_offset(ptr addrspace(3) %ptr
 ;
 ; VI-LABEL: lds_atomic_dec_noret_i32_offset:
 ; VI:       ; %bb.0:
-; VI-NEXT:    s_load_dword s0, s[4:5], 0x0
+; VI-NEXT:    s_load_dword s0, s[6:7], 0x0
 ; VI-NEXT:    v_mov_b32_e32 v0, 42
 ; VI-NEXT:    s_mov_b32 m0, -1
 ; VI-NEXT:    s_waitcnt lgkmcnt(0)
@@ -254,7 +254,7 @@ define amdgpu_kernel void @lds_atomic_dec_noret_i32_offset(ptr addrspace(3) %ptr
 ;
 ; GFX9-LABEL: lds_atomic_dec_noret_i32_offset:
 ; GFX9:       ; %bb.0:
-; GFX9-NEXT:    s_load_dword s0, s[4:5], 0x0
+; GFX9-NEXT:    s_load_dword s0, s[6:7], 0x0
 ; GFX9-NEXT:    v_mov_b32_e32 v0, 42
 ; GFX9-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX9-NEXT:    v_mov_b32_e32 v1, s0
@@ -264,7 +264,7 @@ define amdgpu_kernel void @lds_atomic_dec_noret_i32_offset(ptr addrspace(3) %ptr
 ;
 ; GFX10-LABEL: lds_atomic_dec_noret_i32_offset:
 ; GFX10:       ; %bb.0:
-; GFX10-NEXT:    s_load_dword s0, s[4:5], 0x0
+; GFX10-NEXT:    s_load_dword s0, s[6:7], 0x0
 ; GFX10-NEXT:    v_mov_b32_e32 v0, 42
 ; GFX10-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX10-NEXT:    v_mov_b32_e32 v1, s0
@@ -275,7 +275,7 @@ define amdgpu_kernel void @lds_atomic_dec_noret_i32_offset(ptr addrspace(3) %ptr
 ;
 ; GFX11-LABEL: lds_atomic_dec_noret_i32_offset:
 ; GFX11:       ; %bb.0:
-; GFX11-NEXT:    s_load_b32 s0, s[0:1], 0x0
+; GFX11-NEXT:    s_load_b32 s0, s[2:3], 0x0
 ; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX11-NEXT:    v_dual_mov_b32 v0, 42 :: v_dual_mov_b32 v1, s0
 ; GFX11-NEXT:    ds_dec_u32 v1, v0 offset:16
@@ -290,7 +290,7 @@ define amdgpu_kernel void @lds_atomic_dec_noret_i32_offset(ptr addrspace(3) %ptr
 define amdgpu_kernel void @global_atomic_dec_ret_i32(ptr addrspace(1) %out, ptr addrspace(1) %ptr) #1 {
 ; CI-LABEL: global_atomic_dec_ret_i32:
 ; CI:       ; %bb.0:
-; CI-NEXT:    s_load_dwordx4 s[0:3], s[4:5], 0x0
+; CI-NEXT:    s_load_dwordx4 s[0:3], s[6:7], 0x0
 ; CI-NEXT:    v_mov_b32_e32 v2, 42
 ; CI-NEXT:    s_waitcnt lgkmcnt(0)
 ; CI-NEXT:    v_mov_b32_e32 v0, s2
@@ -305,7 +305,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32(ptr addrspace(1) %out, ptr
 ;
 ; VI-LABEL: global_atomic_dec_ret_i32:
 ; VI:       ; %bb.0:
-; VI-NEXT:    s_load_dwordx4 s[0:3], s[4:5], 0x0
+; VI-NEXT:    s_load_dwordx4 s[0:3], s[6:7], 0x0
 ; VI-NEXT:    v_mov_b32_e32 v2, 42
 ; VI-NEXT:    s_waitcnt lgkmcnt(0)
 ; VI-NEXT:    v_mov_b32_e32 v0, s2
@@ -320,7 +320,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32(ptr addrspace(1) %out, ptr
 ;
 ; GFX9-LABEL: global_atomic_dec_ret_i32:
 ; GFX9:       ; %bb.0:
-; GFX9-NEXT:    s_load_dwordx4 s[0:3], s[4:5], 0x0
+; GFX9-NEXT:    s_load_dwordx4 s[0:3], s[6:7], 0x0
 ; GFX9-NEXT:    v_mov_b32_e32 v0, 42
 ; GFX9-NEXT:    v_mov_b32_e32 v1, 0
 ; GFX9-NEXT:    s_waitcnt lgkmcnt(0)
@@ -332,7 +332,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32(ptr addrspace(1) %out, ptr
 ;
 ; GFX10-LABEL: global_atomic_dec_ret_i32:
 ; GFX10:       ; %bb.0:
-; GFX10-NEXT:    s_load_dwordx4 s[0:3], s[4:5], 0x0
+; GFX10-NEXT:    s_load_dwordx4 s[0:3], s[6:7], 0x0
 ; GFX10-NEXT:    v_mov_b32_e32 v0, 42
 ; GFX10-NEXT:    v_mov_b32_e32 v1, 0
 ; GFX10-NEXT:    s_waitcnt lgkmcnt(0)
@@ -345,7 +345,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32(ptr addrspace(1) %out, ptr
 ;
 ; GFX11-LABEL: global_atomic_dec_ret_i32:
 ; GFX11:       ; %bb.0:
-; GFX11-NEXT:    s_load_b128 s[0:3], s[0:1], 0x0
+; GFX11-NEXT:    s_load_b128 s[0:3], s[2:3], 0x0
 ; GFX11-NEXT:    v_dual_mov_b32 v0, 42 :: v_dual_mov_b32 v1, 0
 ; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX11-NEXT:    global_atomic_dec_u32 v0, v1, v0, s[2:3] glc
@@ -364,7 +364,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32(ptr addrspace(1) %out, ptr
 define amdgpu_kernel void @global_atomic_dec_ret_i32_offset(ptr addrspace(1) %out, ptr addrspace(1) %ptr) #1 {
 ; CI-LABEL: global_atomic_dec_ret_i32_offset:
 ; CI:       ; %bb.0:
-; CI-NEXT:    s_load_dwordx4 s[0:3], s[4:5], 0x0
+; CI-NEXT:    s_load_dwordx4 s[0:3], s[6:7], 0x0
 ; CI-NEXT:    v_mov_b32_e32 v2, 42
 ; CI-NEXT:    s_waitcnt lgkmcnt(0)
 ; CI-NEXT:    s_add_u32 s2, s2, 16
@@ -381,7 +381,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32_offset(ptr addrspace(1) %ou
 ;
 ; VI-LABEL: global_atomic_dec_ret_i32_offset:
 ; VI:       ; %bb.0:
-; VI-NEXT:    s_load_dwordx4 s[0:3], s[4:5], 0x0
+; VI-NEXT:    s_load_dwordx4 s[0:3], s[6:7], 0x0
 ; VI-NEXT:    v_mov_b32_e32 v2, 42
 ; VI-NEXT:    s_waitcnt lgkmcnt(0)
 ; VI-NEXT:    s_add_u32 s2, s2, 16
@@ -398,7 +398,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32_offset(ptr addrspace(1) %ou
 ;
 ; GFX9-LABEL: global_atomic_dec_ret_i32_offset:
 ; GFX9:       ; %bb.0:
-; GFX9-NEXT:    s_load_dwordx4 s[0:3], s[4:5], 0x0
+; GFX9-NEXT:    s_load_dwordx4 s[0:3], s[6:7], 0x0
 ; GFX9-NEXT:    v_mov_b32_e32 v0, 42
 ; GFX9-NEXT:    v_mov_b32_e32 v1, 0
 ; GFX9-NEXT:    s_waitcnt lgkmcnt(0)
@@ -410,7 +410,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32_offset(ptr addrspace(1) %ou
 ;
 ; GFX10-LABEL: global_atomic_dec_ret_i32_offset:
 ; GFX10:       ; %bb.0:
-; GFX10-NEXT:    s_load_dwordx4 s[0:3], s[4:5], 0x0
+; GFX10-NEXT:    s_load_dwordx4 s[0:3], s[6:7], 0x0
 ; GFX10-NEXT:    v_mov_b32_e32 v0, 42
 ; GFX10-NEXT:    v_mov_b32_e32 v1, 0
 ; GFX10-NEXT:    s_waitcnt lgkmcnt(0)
@@ -423,7 +423,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32_offset(ptr addrspace(1) %ou
 ;
 ; GFX11-LABEL: global_atomic_dec_ret_i32_offset:
 ; GFX11:       ; %bb.0:
-; GFX11-NEXT:    s_load_b128 s[0:3], s[0:1], 0x0
+; GFX11-NEXT:    s_load_b128 s[0:3], s[2:3], 0x0
 ; GFX11-NEXT:    v_dual_mov_b32 v0, 42 :: v_dual_mov_b32 v1, 0
 ; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX11-NEXT:    global_atomic_dec_u32 v0, v1, v0, s[2:3] offset:16 glc
@@ -443,7 +443,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32_offset(ptr addrspace(1) %ou
 define amdgpu_kernel void @global_atomic_dec_ret_i32_offset_system(ptr addrspace(1) %out, ptr addrspace(1) %ptr) #1 {
 ; CI-LABEL: global_atomic_dec_ret_i32_offset_system:
 ; CI:       ; %bb.0:
-; CI-NEXT:    s_load_dwordx4 s[0:3], s[4:5], 0x0
+; CI-NEXT:    s_load_dwordx4 s[0:3], s[6:7], 0x0
 ; CI-NEXT:    v_mov_b32_e32 v2, 42
 ; CI-NEXT:    s_waitcnt lgkmcnt(0)
 ; CI-NEXT:    s_add_u32 s2, s2, 16
@@ -460,7 +460,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32_offset_system(ptr addrspace
 ;
 ; VI-LABEL: global_atomic_dec_ret_i32_offset_system:
 ; VI:       ; %bb.0:
-; VI-NEXT:    s_load_dwordx4 s[0:3], s[4:5], 0x0
+; VI-NEXT:    s_load_dwordx4 s[0:3], s[6:7], 0x0
 ; VI-NEXT:    v_mov_b32_e32 v2, 42
 ; VI-NEXT:    s_waitcnt lgkmcnt(0)
 ; VI-NEXT:    s_add_u32 s2, s2, 16
@@ -477,7 +477,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32_offset_system(ptr addrspace
 ;
 ; GFX9-LABEL: global_atomic_dec_ret_i32_offset_system:
 ; GFX9:       ; %bb.0:
-; GFX9-NEXT:    s_load_dwordx4 s[0:3], s[4:5], 0x0
+; GFX9-NEXT:    s_load_dwordx4 s[0:3], s[6:7], 0x0
 ; GFX9-NEXT:    v_mov_b32_e32 v0, 42
 ; GFX9-NEXT:    v_mov_b32_e32 v1, 0
 ; GFX9-NEXT:    s_waitcnt lgkmcnt(0)
@@ -489,7 +489,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32_offset_system(ptr addrspace
 ;
 ; GFX10-LABEL: global_atomic_dec_ret_i32_offset_system:
 ; GFX10:       ; %bb.0:
-; GFX10-NEXT:    s_load_dwordx4 s[0:3], s[4:5], 0x0
+; GFX10-NEXT:    s_load_dwordx4 s[0:3], s[6:7], 0x0
 ; GFX10-NEXT:    v_mov_b32_e32 v0, 42
 ; GFX10-NEXT:    v_mov_b32_e32 v1, 0
 ; GFX10-NEXT:    s_waitcnt lgkmcnt(0)
@@ -502,7 +502,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32_offset_system(ptr addrspace
 ;
 ; GFX11-LABEL: global_atomic_dec_ret_i32_offset_system:
 ; GFX11:       ; %bb.0:
-; GFX11-NEXT:    s_load_b128 s[0:3], s[0:1], 0x0
+; GFX11-NEXT:    s_load_b128 s[0:3], s[2:3], 0x0
 ; GFX11-NEXT:    v_dual_mov_b32 v0, 42 :: v_dual_mov_b32 v1, 0
 ; GFX11-NEXT:    s_waitcnt lgkmcnt(0)
 ; GFX11-NEXT:    global_atomic_dec_u32 v0, v1, v0, s[2:3] offset:16 glc
@@ -522,7 +522,7 @@ define amdgpu_kernel void @global_atomic_dec_ret_i32_offset...
[truncated]

Copy link
Contributor

@shiltian shiltian left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The non-test related changes look fairly straightforward. LGTM.


// FIXME: We can spill incoming arguments and restore at the end of the
// prolog.
if (!ScratchWaveOffsetReg)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this change related?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Kind of. One test hit the assert due to the sgpr limiting attribute

@rampitec
Copy link
Collaborator

rampitec commented Feb 27, 2024

The main question, is it required for correctness now or not? I.e. can we use -O0 w/o the attributor?

If not required for -O0 then LGTM.

@arsenm
Copy link
Contributor Author

arsenm commented Feb 28, 2024

The main question, is it required for correctness now or not? I.e. can we use -O0 w/o the attributor?

No, it never was. The "required" stuff was in the other pass which was deleted already

@arsenm
Copy link
Contributor Author

arsenm commented Mar 1, 2024

This is stuck due to OpenCL blender failing in PSDB (which is almost certainly a result of perturbing anything, such that something else already broken is exposed)

arsenm added a commit to arsenm/llvm-project that referenced this pull request May 29, 2024
Explicitly mark the unused implicit arguments in the test, since this
should be sensitive to the number of free user SGPRs.

This is in preparation for llvm#83131.
arsenm added a commit that referenced this pull request May 30, 2024
Explicitly mark the unused implicit arguments in the test, since this
should be sensitive to the number of free user SGPRs.

This is in preparation for #83131.
@arsenm arsenm mentioned this pull request Jun 6, 2024
@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder llvm-clang-x86_64-sie-ubuntu-fast running on sie-linux-worker while building clang,llvm at step 6 "test-build-unified-tree-check-all".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/144/builds/2221

Here is the relevant piece of the build log for the reference:

Step 6 (test-build-unified-tree-check-all) failure: test (failure)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/build/bin/clang -cc1 -internal-isystem /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/build/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/build/bin/FileCheck -check-prefix=OPTNONE /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/build/bin/clang -cc1 -internal-isystem /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/build/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/build/bin/FileCheck -check-prefix=OPTNONE /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
RUN: at line 5: /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/build/bin/clang -cc1 -internal-isystem /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/build/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/build/bin/FileCheck -check-prefix=OPT /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/build/bin/FileCheck -check-prefix=OPT /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/build/bin/clang -cc1 -internal-isystem /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/build/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
�[1m/home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/llvm-project/clang/test/CodeGenHIP/default-attributes.hip:55:9: �[0m�[0;1;31merror: �[0m�[1mOPT: expected string not found in input
�[0m// OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
�[0;1;32m        ^
�[0m�[1m<stdin>:19:10: �[0m�[0;1;30mnote: �[0m�[1mscanning from here
�[0m ret void
�[0;1;32m         ^
�[0m�[1m<stdin>:19:10: �[0m�[0;1;30mnote: �[0m�[1mwith "ATTR0" equal to "0"
�[0m ret void
�[0;1;32m         ^
�[0m
Input file: <stdin>
Check file: /home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/llvm-project/clang/test/CodeGenHIP/default-attributes.hip

-dump-input=help explains the following input dump.

Input was:
<<<<<<
�[1m�[0m�[0;1;30m            1: �[0m�[1m�[0;1;46m; ModuleID = '/home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/llvm-project/clang/test/CodeGenHIP/default-attributes.hip' �[0m
�[0;1;30m            2: �[0m�[1m�[0;1;46msource_filename = "/home/buildbot/buildbot-root/llvm-clang-x86_64-sie-ubuntu-fast/llvm-project/clang/test/CodeGenHIP/default-attributes.hip" �[0m
�[0;1;30m            3: �[0m�[1m�[0;1;46mtarget datalayout = "e-p:64:64-p1:64:64-p2:32:32-p3:32:32-p4:64:64-p5:32:32-p6:32:32-p7:160:256:256:32-p8:128:128-p9:192:256:256:32-i64:64-v16:16-v24:32-v32:32-v48:64-v96:128-v192:256-v256:256-v512:512-v1024:1024-v2048:2048-n32:64-S32-A5-G1-ni:7:8:9" �[0m
�[0;1;30m            4: �[0m�[1m�[0;1;46mtarget triple = "amdgcn-amd-amdhsa" �[0m
�[0;1;30m            5: �[0m�[1m�[0;1;46m �[0m
�[0;1;30m            6: �[0m�[1m�[0;1;46m�[0m@__hip_cuid_ = addrspace(1) global i8 0�[0;1;46m �[0m
�[0;1;32mcheck:16       ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;30m            7: �[0m�[1m�[0;1;46m�[0m@__oclc_ABI_version = weak_odr hidden local_unnamed_addr addrspace(4) constant i32 500�[0;1;46m �[0m
�[0;1;32mcheck:17       ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;30m            8: �[0m�[1m�[0;1;46m�[0m@llvm.compiler.used = appending addrspace(1) global [1 x ptr] [ptr addrspacecast (ptr addrspace(1) @__hip_cuid_ to ptr)], section "llvm.metadata"�[0;1;46m �[0m
�[0;1;32mcheck:18       ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;30m            9: �[0m�[1m�[0;1;46m �[0m
�[0;1;30m           10: �[0m�[1m�[0;1;46m; �[0mFunction Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none)�[0;1;46m �[0m
�[0;1;32mcheck:26         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;30m           11: �[0m�[1m�[0;1;46m�[0mdefine dso_local void @_Z4funcv() local_unnamed_addr #0 {�[0;1;46m �[0m
�[0;1;32mlabel:27'0     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;32mlabel:27'1     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;32msame:28'0                                     ^~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;32msame:28'1                                                            ^    captured var "ATTR0"
�[0m�[0;1;30m           12: �[0m�[1m�[0;1;46m�[0mentry:�[0;1;46m �[0m
�[0;1;32mnext:29        ^~~~~~
�[0m�[0;1;30m           13: �[0m�[1m�[0;1;46m �[0mret void�[0;1;46m �[0m
...

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder openmp-offload-amdgpu-runtime running on omp-vega20-0 while building clang,llvm at step 7 "Add check check-offload".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/30/builds/1923

Here is the relevant piece of the build log for the reference:

Step 7 (Add check check-offload) failure: test (failure)
******************** TEST 'libomptarget :: amdgcn-amd-amdhsa :: api/omp_dynamic_shared_memory_amdgpu.c' FAILED ********************
Exit Code: 1

Command Output (stdout):
--
# RUN: at line 1
/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/./bin/clang -fopenmp    -I /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.src/offload/test -I /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -L /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload -L /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/./lib -L /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src  -nogpulib -Wl,-rpath,/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload -Wl,-rpath,/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -Wl,-rpath,/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/./lib  -fopenmp-targets=amdgcn-amd-amdhsa /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.src/offload/test/api/omp_dynamic_shared_memory_amdgpu.c -o /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload/test/amdgcn-amd-amdhsa/api/Output/omp_dynamic_shared_memory_amdgpu.c.tmp /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/./lib/libomptarget.devicertl.a -O1 -mllvm -openmp-opt-inline-device
# executed command: /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/./bin/clang -fopenmp -I /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.src/offload/test -I /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -L /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload -L /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/./lib -L /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -nogpulib -Wl,-rpath,/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload -Wl,-rpath,/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -Wl,-rpath,/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/./lib -fopenmp-targets=amdgcn-amd-amdhsa /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.src/offload/test/api/omp_dynamic_shared_memory_amdgpu.c -o /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload/test/amdgcn-amd-amdhsa/api/Output/omp_dynamic_shared_memory_amdgpu.c.tmp /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/./lib/libomptarget.devicertl.a -O1 -mllvm -openmp-opt-inline-device
# .---command stderr------------
# | clang-linker-wrapper: /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.src/llvm/lib/Target/AMDGPU/AMDGPUResourceUsageAnalysis.cpp:158: virtual bool llvm::AMDGPUResourceUsageAnalysis::runOnModule(llvm::Module&): Assertion `MF && "function must have been generated already"' failed.
# | PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace.
# | Stack dump:
# | 0.	Program arguments: /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper --opt-level=O1 --host-triple=x86_64-unknown-linux-gnu -mllvm -openmp-opt-inline-device --linker-path=/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/ld.lld -z relro --hash-style=gnu --eh-frame-hdr -m elf_x86_64 -pie -dynamic-linker /lib64/ld-linux-x86-64.so.2 -o /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload/test/amdgcn-amd-amdhsa/api/Output/omp_dynamic_shared_memory_amdgpu.c.tmp /lib/x86_64-linux-gnu/Scrt1.o /lib/x86_64-linux-gnu/crti.o /usr/lib/gcc/x86_64-linux-gnu/9/crtbeginS.o -L/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload -L/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/./lib -L/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -L/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/lib/clang/19/lib/x86_64-unknown-linux-gnu -L/usr/lib/gcc/x86_64-linux-gnu/9 -L/usr/lib/gcc/x86_64-linux-gnu/9/../../../../lib64 -L/lib/x86_64-linux-gnu -L/lib/../lib64 -L/usr/lib/x86_64-linux-gnu -L/usr/lib/../lib64 -L/lib -L/usr/lib -rpath /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload -rpath /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -rpath /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/./lib /tmp/lit-tmp-8gk6lv48/omp_dynamic_shared_memory_amdgpu-49581b.o /home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/./lib/libomptarget.devicertl.a -lomp -lomptarget -L/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/lib -lgcc --as-needed -lgcc_s --no-as-needed -lpthread -lc -lgcc --as-needed -lgcc_s --no-as-needed /usr/lib/gcc/x86_64-linux-gnu/9/crtendS.o /lib/x86_64-linux-gnu/crtn.o
# | 1.	Running pass 'Function register usage analysis' on module 'ld-temp.o'.
# |  #0 0x000055c720c0927f llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x1b2427f)
# |  #1 0x000055c720c067c4 SignalHandler(int) Signals.cpp:0:0
# |  #2 0x00007f54cd623420 __restore_rt (/lib/x86_64-linux-gnu/libpthread.so.0+0x14420)
# |  #3 0x00007f54cd0f000b raise (/lib/x86_64-linux-gnu/libc.so.6+0x4300b)
# |  #4 0x00007f54cd0cf859 abort (/lib/x86_64-linux-gnu/libc.so.6+0x22859)
# |  #5 0x00007f54cd0cf729 (/lib/x86_64-linux-gnu/libc.so.6+0x22729)
# |  #6 0x00007f54cd0e0fd6 (/lib/x86_64-linux-gnu/libc.so.6+0x33fd6)
# |  #7 0x000055c71fbf5106 llvm::AMDGPUResourceUsageAnalysis::runOnModule(llvm::Module&) (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0xb10106)
# |  #8 0x000055c7203407a5 llvm::legacy::PassManagerImpl::run(llvm::Module&) (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x125b7a5)
# |  #9 0x000055c7212bffad codegen(llvm::lto::Config const&, llvm::TargetMachine*, std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>, unsigned int, llvm::Module&, llvm::ModuleSummaryIndex const&) LTOBackend.cpp:0:0
# | #10 0x000055c7212c195d llvm::lto::backend(llvm::lto::Config const&, std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>, unsigned int, llvm::Module&, llvm::ModuleSummaryIndex&) (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x21dc95d)
# | #11 0x000055c7212b604a llvm::lto::LTO::runRegularLTO(std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>) (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x21d104a)
# | #12 0x000055c7212b658c llvm::lto::LTO::run(std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>, std::function<llvm::Expected<std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>> (unsigned int, llvm::StringRef, llvm::Twine const&)>) (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x21d158c)
# | #13 0x000055c71f78d2c0 llvm::Error (anonymous namespace)::linkAndWrapDeviceFiles(llvm::SmallVectorImpl<llvm::SmallVector<llvm::object::OffloadFile, 3u>>&, llvm::opt::InputArgList const&, char**, int)::'lambda'(auto&)::operator()<llvm::SmallVector<llvm::object::OffloadFile, 3u>>(auto&) const ClangLinkerWrapper.cpp:0:0
# | #14 0x000055c71f791fe6 (anonymous namespace)::linkAndWrapDeviceFiles(llvm::SmallVectorImpl<llvm::SmallVector<llvm::object::OffloadFile, 3u>>&, llvm::opt::InputArgList const&, char**, int) ClangLinkerWrapper.cpp:0:0
# | #15 0x000055c71f6b338f main (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x5ce38f)
# | #16 0x00007f54cd0d1083 __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x24083)
# | #17 0x000055c71f771b5e _start (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x68cb5e)
# |  #0 0x000055c720c0927f llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x1b2427f)
# |  #1 0x000055c720c067c4 SignalHandler(int) Signals.cpp:0:0
# |  #2 0x00007f54cd623420 __restore_rt (/lib/x86_64-linux-gnu/libpthread.so.0+0x14420)
# |  #3 0x00007f54cd0f000b raise (/lib/x86_64-linux-gnu/libc.so.6+0x4300b)
# |  #4 0x00007f54cd0cf859 abort (/lib/x86_64-linux-gnu/libc.so.6+0x22859)
# |  #5 0x00007f54cd0cf729 (/lib/x86_64-linux-gnu/libc.so.6+0x22729)
# |  #6 0x00007f54cd0e0fd6 (/lib/x86_64-linux-gnu/libc.so.6+0x33fd6)
# |  #7 0x000055c71fbf5106 llvm::AMDGPUResourceUsageAnalysis::runOnModule(llvm::Module&) (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0xb10106)
# |  #8 0x000055c7203407a5 llvm::legacy::PassManagerImpl::run(llvm::Module&) (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x125b7a5)
# |  #9 0x000055c7212bffad codegen(llvm::lto::Config const&, llvm::TargetMachine*, std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>, unsigned int, llvm::Module&, llvm::ModuleSummaryIndex const&) LTOBackend.cpp:0:0
# | #10 0x000055c7212c195d llvm::lto::backend(llvm::lto::Config const&, std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>, unsigned int, llvm::Module&, llvm::ModuleSummaryIndex&) (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x21dc95d)
# | #11 0x000055c7212b604a llvm::lto::LTO::runRegularLTO(std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>) (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x21d104a)
# | #12 0x000055c7212b658c llvm::lto::LTO::run(std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>, std::function<llvm::Expected<std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>> (unsigned int, llvm::StringRef, llvm::Twine const&)>) (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x21d158c)
# | #13 0x000055c71f78d2c0 llvm::Error (anonymous namespace)::linkAndWrapDeviceFiles(llvm::SmallVectorImpl<llvm::SmallVector<llvm::object::OffloadFile, 3u>>&, llvm::opt::InputArgList const&, char**, int)::'lambda'(auto&)::operator()<llvm::SmallVector<llvm::object::OffloadFile, 3u>>(auto&) const ClangLinkerWrapper.cpp:0:0
# | #14 0x000055c71f791fe6 (anonymous namespace)::linkAndWrapDeviceFiles(llvm::SmallVectorImpl<llvm::SmallVector<llvm::object::OffloadFile, 3u>>&, llvm::opt::InputArgList const&, char**, int) ClangLinkerWrapper.cpp:0:0
# | #15 0x000055c71f6b338f main (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x5ce38f)
# | #16 0x00007f54cd0d1083 __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x24083)
# | #17 0x000055c71f771b5e _start (/home/ompworker/bbot/openmp-offload-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x68cb5e)
...

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder clang-ve-ninja running on hpce-ve-main while building clang,llvm at step 4 "annotate".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/12/builds/1814

Here is the relevant piece of the build log for the reference:

Step 4 (annotate) failure: 'python ../llvm-zorg/zorg/buildbot/builders/annotated/ve-linux.py ...' (failure)
...
[297/303] Linking CXX executable tools/clang/unittests/Driver/ClangDriverTests
[298/303] Linking CXX executable tools/clang/unittests/CodeGen/ClangCodeGenTests
[299/303] Linking CXX executable tools/clang/unittests/Frontend/FrontendTests
[300/303] Linking CXX executable tools/clang/unittests/Interpreter/ExceptionTests/ClangReplInterpreterExceptionTests
[301/303] Linking CXX executable tools/clang/unittests/Interpreter/ClangReplInterpreterTests
[302/303] Linking CXX executable tools/clang/unittests/Tooling/ToolingTests
[302/303] Running the Clang regression tests
-- Testing: 20827 tests, 48 workers --
llvm-lit: /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/llvm/utils/lit/lit/llvm/config.py:508: note: using clang: /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/clang
Testing:  0.. 10.. 20.. 30.. 40
FAIL: Clang :: CodeGenHIP/default-attributes.hip (9284 of 20827)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/clang -cc1 -internal-isystem /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/FileCheck -check-prefix=OPTNONE /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/clang -cc1 -internal-isystem /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/FileCheck -check-prefix=OPTNONE /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
RUN: at line 5: /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/clang -cc1 -internal-isystem /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/FileCheck -check-prefix=OPT /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/clang -cc1 -internal-isystem /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/FileCheck -check-prefix=OPT /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
/scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip:55:9: error: OPT: expected string not found in input
// OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
        ^
<stdin>:19:10: note: scanning from here
 ret void
         ^
<stdin>:19:10: note: with "ATTR0" equal to "0"
 ret void
         ^

Input file: <stdin>
Check file: /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip

-dump-input=help explains the following input dump.

Input was:
<<<<<<
            .
            .
            .
           14: } 
           15:  
           16: ; Function Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none) 
           17: define dso_local amdgpu_kernel void @_Z6kernelv() local_unnamed_addr #1 { 
           18: entry: 
           19:  ret void 
check:55'0              X error: no match found
Step 8 (check-llvm) failure: check-llvm (failure)
...
[297/303] Linking CXX executable tools/clang/unittests/Driver/ClangDriverTests
[298/303] Linking CXX executable tools/clang/unittests/CodeGen/ClangCodeGenTests
[299/303] Linking CXX executable tools/clang/unittests/Frontend/FrontendTests
[300/303] Linking CXX executable tools/clang/unittests/Interpreter/ExceptionTests/ClangReplInterpreterExceptionTests
[301/303] Linking CXX executable tools/clang/unittests/Interpreter/ClangReplInterpreterTests
[302/303] Linking CXX executable tools/clang/unittests/Tooling/ToolingTests
[302/303] Running the Clang regression tests
-- Testing: 20827 tests, 48 workers --
llvm-lit: /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/llvm/utils/lit/lit/llvm/config.py:508: note: using clang: /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/clang
Testing:  0.. 10.. 20.. 30.. 40
FAIL: Clang :: CodeGenHIP/default-attributes.hip (9284 of 20827)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/clang -cc1 -internal-isystem /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/FileCheck -check-prefix=OPTNONE /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/clang -cc1 -internal-isystem /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/FileCheck -check-prefix=OPTNONE /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
RUN: at line 5: /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/clang -cc1 -internal-isystem /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/FileCheck -check-prefix=OPT /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/clang -cc1 -internal-isystem /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /scratch/buildbot/bothome/clang-ve-ninja/build/build_llvm/bin/FileCheck -check-prefix=OPT /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
/scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip:55:9: error: OPT: expected string not found in input
// OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
        ^
<stdin>:19:10: note: scanning from here
 ret void
         ^
<stdin>:19:10: note: with "ATTR0" equal to "0"
 ret void
         ^

Input file: <stdin>
Check file: /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/CodeGenHIP/default-attributes.hip

-dump-input=help explains the following input dump.

Input was:
<<<<<<
            .
            .
            .
           14: } 
           15:  
           16: ; Function Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none) 
           17: define dso_local amdgpu_kernel void @_Z6kernelv() local_unnamed_addr #1 { 
           18: entry: 
           19:  ret void 
check:55'0              X error: no match found

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder arc-builder running on arc-worker while building clang,llvm at step 6 "test-build-unified-tree-check-all".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/3/builds/1447

Here is the relevant piece of the build log for the reference:

Step 6 (test-build-unified-tree-check-all) failure: test (failure)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /buildbot/worker/arc-folder/build/bin/clang -cc1 -internal-isystem /buildbot/worker/arc-folder/build/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /buildbot/worker/arc-folder/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /buildbot/worker/arc-folder/build/bin/FileCheck -check-prefix=OPTNONE /buildbot/worker/arc-folder/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /buildbot/worker/arc-folder/build/bin/clang -cc1 -internal-isystem /buildbot/worker/arc-folder/build/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /buildbot/worker/arc-folder/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /buildbot/worker/arc-folder/build/bin/FileCheck -check-prefix=OPTNONE /buildbot/worker/arc-folder/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
RUN: at line 5: /buildbot/worker/arc-folder/build/bin/clang -cc1 -internal-isystem /buildbot/worker/arc-folder/build/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /buildbot/worker/arc-folder/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /buildbot/worker/arc-folder/build/bin/FileCheck -check-prefix=OPT /buildbot/worker/arc-folder/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /buildbot/worker/arc-folder/build/bin/FileCheck -check-prefix=OPT /buildbot/worker/arc-folder/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /buildbot/worker/arc-folder/build/bin/clang -cc1 -internal-isystem /buildbot/worker/arc-folder/build/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /buildbot/worker/arc-folder/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
/buildbot/worker/arc-folder/llvm-project/clang/test/CodeGenHIP/default-attributes.hip:55:9: error: OPT: expected string not found in input
// OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
        ^
<stdin>:19:10: note: scanning from here
 ret void
         ^
<stdin>:19:10: note: with "ATTR0" equal to "0"
 ret void
         ^

Input file: <stdin>
Check file: /buildbot/worker/arc-folder/llvm-project/clang/test/CodeGenHIP/default-attributes.hip

-dump-input=help explains the following input dump.

Input was:
<<<<<<
            .
            .
            .
           14: } 
           15:  
           16: ; Function Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none) 
           17: define dso_local amdgpu_kernel void @_Z6kernelv() local_unnamed_addr #1 { 
           18: entry: 
           19:  ret void 
check:55'0              X error: no match found
check:55'1                with "ATTR0" equal to "0"
           20: } 
check:55'0     ~~
           21:  
check:55'0     ~
           22: attributes #0 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "no-trapping-math"="true" "stack-protector-buffer-size"="8" } 
check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
           23: attributes #1 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-flat-work-group-size"="1,1024" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="true" } 
check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
           24:  
check:55'0     ~
            .
...

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder clang-hip-vega20 running on hip-vega20-0 while building clang,llvm at step 3 "annotate".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/123/builds/1764

Here is the relevant piece of the build log for the reference:

Step 3 (annotate) failure: '../llvm-zorg/zorg/buildbot/builders/annotated/hip-build.sh --jobs=' (failure)
...
[36/38] : && /buildbot/hip-vega20-0/clang-hip-vega20/llvm/bin/clang++ -O3 -DNDEBUG  External/HIP/CMakeFiles/InOneWeekend-hip-6.0.2.dir/workload/ray-tracing/InOneWeekend/main.cc.o -o External/HIP/InOneWeekend-hip-6.0.2  --rocm-path=/buildbot/Externals/hip/rocm-6.0.2 --hip-link -rtlib=compiler-rt -unwindlib=libgcc -frtlib-add-rpath && cd /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP && /usr/local/bin/cmake -E create_symlink /buildbot/llvm-test-suite/External/HIP/InOneWeekend.reference_output /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP/InOneWeekend.reference_output-hip-6.0.2
[37/38] /buildbot/hip-vega20-0/clang-hip-vega20/llvm/bin/clang++ -DNDEBUG  -O3 -DNDEBUG   -w -Werror=date-time --rocm-path=/buildbot/Externals/hip/rocm-6.0.2 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx1030 --offload-arch=gfx1100 -xhip -mfma -MD -MT External/HIP/CMakeFiles/TheNextWeek-hip-6.0.2.dir/workload/ray-tracing/TheNextWeek/main.cc.o -MF External/HIP/CMakeFiles/TheNextWeek-hip-6.0.2.dir/workload/ray-tracing/TheNextWeek/main.cc.o.d -o External/HIP/CMakeFiles/TheNextWeek-hip-6.0.2.dir/workload/ray-tracing/TheNextWeek/main.cc.o -c /buildbot/llvm-test-suite/External/HIP/workload/ray-tracing/TheNextWeek/main.cc
[38/38] : && /buildbot/hip-vega20-0/clang-hip-vega20/llvm/bin/clang++ -O3 -DNDEBUG  External/HIP/CMakeFiles/TheNextWeek-hip-6.0.2.dir/workload/ray-tracing/TheNextWeek/main.cc.o -o External/HIP/TheNextWeek-hip-6.0.2  --rocm-path=/buildbot/Externals/hip/rocm-6.0.2 --hip-link -rtlib=compiler-rt -unwindlib=libgcc -frtlib-add-rpath && cd /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP && /usr/local/bin/cmake -E create_symlink /buildbot/llvm-test-suite/External/HIP/TheNextWeek.reference_output /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP/TheNextWeek.reference_output-hip-6.0.2
+ build_step 'Testing HIP test-suite'
+ echo '@@@BUILD_STEP Testing HIP test-suite@@@'
+ ninja -v check-hip-simple
@@@BUILD_STEP Testing HIP test-suite@@@
[0/1] cd /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP && /buildbot/hip-vega20-0/clang-hip-vega20/llvm/bin/llvm-lit -sv empty-hip-6.0.2.test with-fopenmp-hip-6.0.2.test saxpy-hip-6.0.2.test InOneWeekend-hip-6.0.2.test TheNextWeek-hip-6.0.2.test blender.test
-- Testing: 6 tests, 6 workers --
Testing:  0.. 10.. 20.. 30.. 40.. 
FAIL: test-suite :: External/HIP/InOneWeekend-hip-6.0.2.test (4 of 6)
******************** TEST 'test-suite :: External/HIP/InOneWeekend-hip-6.0.2.test' FAILED ********************

/buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/tools/timeit-target --timeout 7200 --limit-core 0 --limit-cpu 7200 --limit-file-size 209715200 --limit-rss-size 838860800 --append-exitstatus --redirect-output /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP/Output/InOneWeekend-hip-6.0.2.test.out --redirect-input /dev/null --summary /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP/Output/InOneWeekend-hip-6.0.2.test.time /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP/InOneWeekend-hip-6.0.2
cd /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP ; /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/tools/fpcmp-target /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP/Output/InOneWeekend-hip-6.0.2.test.out InOneWeekend.reference_output-hip-6.0.2

+ cd /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP
+ /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/tools/fpcmp-target /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP/Output/InOneWeekend-hip-6.0.2.test.out InOneWeekend.reference_output-hip-6.0.2
/buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/tools/fpcmp-target: Comparison failed, textual difference between 'M' and 'i'

********************
/usr/bin/strip: /bin/bash.stripped: Bad file descriptor
Testing:  0.. 10.. 20.. 30.. 40.. 50.. 60.. 70.. 80.. 90.. 
********************
Failed Tests (1):
  test-suite :: External/HIP/InOneWeekend-hip-6.0.2.test


Testing Time: 382.10s

Total Discovered Tests: 6
  Passed: 5 (83.33%)
  Failed: 1 (16.67%)
FAILED: External/HIP/CMakeFiles/check-hip-simple-hip-6.0.2 
cd /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP && /buildbot/hip-vega20-0/clang-hip-vega20/llvm/bin/llvm-lit -sv empty-hip-6.0.2.test with-fopenmp-hip-6.0.2.test saxpy-hip-6.0.2.test InOneWeekend-hip-6.0.2.test TheNextWeek-hip-6.0.2.test blender.test
ninja: build stopped: subcommand failed.
Step 12 (Testing HIP test-suite) failure: Testing HIP test-suite (failure)
@@@BUILD_STEP Testing HIP test-suite@@@
[0/1] cd /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP && /buildbot/hip-vega20-0/clang-hip-vega20/llvm/bin/llvm-lit -sv empty-hip-6.0.2.test with-fopenmp-hip-6.0.2.test saxpy-hip-6.0.2.test InOneWeekend-hip-6.0.2.test TheNextWeek-hip-6.0.2.test blender.test
-- Testing: 6 tests, 6 workers --
Testing:  0.. 10.. 20.. 30.. 40.. 
FAIL: test-suite :: External/HIP/InOneWeekend-hip-6.0.2.test (4 of 6)
******************** TEST 'test-suite :: External/HIP/InOneWeekend-hip-6.0.2.test' FAILED ********************

/buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/tools/timeit-target --timeout 7200 --limit-core 0 --limit-cpu 7200 --limit-file-size 209715200 --limit-rss-size 838860800 --append-exitstatus --redirect-output /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP/Output/InOneWeekend-hip-6.0.2.test.out --redirect-input /dev/null --summary /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP/Output/InOneWeekend-hip-6.0.2.test.time /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP/InOneWeekend-hip-6.0.2
cd /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP ; /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/tools/fpcmp-target /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP/Output/InOneWeekend-hip-6.0.2.test.out InOneWeekend.reference_output-hip-6.0.2

+ cd /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP
+ /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/tools/fpcmp-target /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP/Output/InOneWeekend-hip-6.0.2.test.out InOneWeekend.reference_output-hip-6.0.2
/buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/tools/fpcmp-target: Comparison failed, textual difference between 'M' and 'i'

********************
/usr/bin/strip: /bin/bash.stripped: Bad file descriptor
Testing:  0.. 10.. 20.. 30.. 40.. 50.. 60.. 70.. 80.. 90.. 
********************
Failed Tests (1):
  test-suite :: External/HIP/InOneWeekend-hip-6.0.2.test


Testing Time: 382.10s

Total Discovered Tests: 6
  Passed: 5 (83.33%)
  Failed: 1 (16.67%)
FAILED: External/HIP/CMakeFiles/check-hip-simple-hip-6.0.2 
cd /buildbot/hip-vega20-0/clang-hip-vega20/test-suite-build/External/HIP && /buildbot/hip-vega20-0/clang-hip-vega20/llvm/bin/llvm-lit -sv empty-hip-6.0.2.test with-fopenmp-hip-6.0.2.test saxpy-hip-6.0.2.test InOneWeekend-hip-6.0.2.test TheNextWeek-hip-6.0.2.test blender.test
ninja: build stopped: subcommand failed.
program finished with exit code 1
elapsedTime=500.701520

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder openmp-offload-libc-amdgpu-runtime running on omp-vega20-1 while building clang,llvm at step 10 "Add check check-offload".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/73/builds/1794

Here is the relevant piece of the build log for the reference:

Step 10 (Add check check-offload) failure: test (failure)
******************** TEST 'libomptarget :: amdgcn-amd-amdhsa :: api/omp_dynamic_shared_memory_mixed_amdgpu.c' FAILED ********************
Exit Code: 1

Command Output (stdout):
--
# RUN: at line 1
/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./bin/clang -fopenmp    -I /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.src/offload/test -I /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -L /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload -L /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./lib -L /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src  -nogpulib -Wl,-rpath,/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload -Wl,-rpath,/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -Wl,-rpath,/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./lib  -fopenmp-targets=amdgcn-amd-amdhsa /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.src/offload/test/api/omp_dynamic_shared_memory_mixed_amdgpu.c -o /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload/test/amdgcn-amd-amdhsa/api/Output/omp_dynamic_shared_memory_mixed_amdgpu.c.tmp /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./lib/libcgpu-amdgpu.a /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./lib/libomptarget.devicertl.a -O1 -mllvm -openmp-opt-inline-device -I /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.src/offload/test/api
# executed command: /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./bin/clang -fopenmp -I /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.src/offload/test -I /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -L /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload -L /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./lib -L /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -nogpulib -Wl,-rpath,/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload -Wl,-rpath,/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -Wl,-rpath,/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./lib -fopenmp-targets=amdgcn-amd-amdhsa /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.src/offload/test/api/omp_dynamic_shared_memory_mixed_amdgpu.c -o /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload/test/amdgcn-amd-amdhsa/api/Output/omp_dynamic_shared_memory_mixed_amdgpu.c.tmp /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./lib/libcgpu-amdgpu.a /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./lib/libomptarget.devicertl.a -O1 -mllvm -openmp-opt-inline-device -I /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.src/offload/test/api
# .---command stderr------------
# | clang-linker-wrapper: /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.src/llvm/lib/Target/AMDGPU/AMDGPUResourceUsageAnalysis.cpp:158: virtual bool llvm::AMDGPUResourceUsageAnalysis::runOnModule(llvm::Module&): Assertion `MF && "function must have been generated already"' failed.
# | PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace.
# | Stack dump:
# | 0.	Program arguments: /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper --opt-level=O1 --host-triple=x86_64-unknown-linux-gnu -mllvm -openmp-opt-inline-device --linker-path=/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/ld.lld -z relro --hash-style=gnu --eh-frame-hdr -m elf_x86_64 -pie -dynamic-linker /lib64/ld-linux-x86-64.so.2 -o /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload/test/amdgcn-amd-amdhsa/api/Output/omp_dynamic_shared_memory_mixed_amdgpu.c.tmp /lib/x86_64-linux-gnu/Scrt1.o /lib/x86_64-linux-gnu/crti.o /usr/lib/gcc/x86_64-linux-gnu/9/crtbeginS.o -L/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload -L/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./lib -L/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -L/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/lib/clang/19/lib/x86_64-unknown-linux-gnu -L/usr/lib/gcc/x86_64-linux-gnu/9 -L/usr/lib/gcc/x86_64-linux-gnu/9/../../../../lib64 -L/lib/x86_64-linux-gnu -L/lib/../lib64 -L/usr/lib/x86_64-linux-gnu -L/usr/lib/../lib64 -L/lib -L/usr/lib -rpath /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/offload -rpath /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/runtimes/runtimes-bins/openmp/runtime/src -rpath /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./lib /tmp/lit-tmp-7s3tyfa7/omp_dynamic_shared_memory_mixed_amdgpu-df1148.o /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./lib/libcgpu-amdgpu.a /home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/./lib/libomptarget.devicertl.a -lomp -lomptarget -L/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/lib -lgcc --as-needed -lgcc_s --no-as-needed -lpthread -lc -lgcc --as-needed -lgcc_s --no-as-needed /usr/lib/gcc/x86_64-linux-gnu/9/crtendS.o /lib/x86_64-linux-gnu/crtn.o
# | 1.	Running pass 'Function register usage analysis' on module 'ld-temp.o'.
# |  #0 0x000055fab1a1e27f llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x1b2427f)
# |  #1 0x000055fab1a1b7c4 SignalHandler(int) Signals.cpp:0:0
# |  #2 0x00007ff56bdf8420 __restore_rt (/lib/x86_64-linux-gnu/libpthread.so.0+0x14420)
# |  #3 0x00007ff56b8c500b raise /build/glibc-LcI20x/glibc-2.31/signal/../sysdeps/unix/sysv/linux/raise.c:51:1
# |  #4 0x00007ff56b8a4859 abort /build/glibc-LcI20x/glibc-2.31/stdlib/abort.c:81:7
# |  #5 0x00007ff56b8a4729 get_sysdep_segment_value /build/glibc-LcI20x/glibc-2.31/intl/loadmsgcat.c:509:8
# |  #6 0x00007ff56b8a4729 _nl_load_domain /build/glibc-LcI20x/glibc-2.31/intl/loadmsgcat.c:970:34
# |  #7 0x00007ff56b8b5fd6 (/lib/x86_64-linux-gnu/libc.so.6+0x33fd6)
# |  #8 0x000055fab0a0a106 llvm::AMDGPUResourceUsageAnalysis::runOnModule(llvm::Module&) (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0xb10106)
# |  #9 0x000055fab11557a5 llvm::legacy::PassManagerImpl::run(llvm::Module&) (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x125b7a5)
# | #10 0x000055fab20d50bd codegen(llvm::lto::Config const&, llvm::TargetMachine*, std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>, unsigned int, llvm::Module&, llvm::ModuleSummaryIndex const&) LTOBackend.cpp:0:0
# | #11 0x000055fab20d6a6d llvm::lto::backend(llvm::lto::Config const&, std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>, unsigned int, llvm::Module&, llvm::ModuleSummaryIndex&) (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x21dca6d)
# | #12 0x000055fab20cb15a llvm::lto::LTO::runRegularLTO(std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>) (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x21d115a)
# | #13 0x000055fab20cb69c llvm::lto::LTO::run(std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>, std::function<llvm::Expected<std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>> (unsigned int, llvm::StringRef, llvm::Twine const&)>) (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x21d169c)
# | #14 0x000055fab05a22c0 llvm::Error (anonymous namespace)::linkAndWrapDeviceFiles(llvm::SmallVectorImpl<llvm::SmallVector<llvm::object::OffloadFile, 3u>>&, llvm::opt::InputArgList const&, char**, int)::'lambda'(auto&)::operator()<llvm::SmallVector<llvm::object::OffloadFile, 3u>>(auto&) const ClangLinkerWrapper.cpp:0:0
# | #15 0x000055fab05a6fe6 (anonymous namespace)::linkAndWrapDeviceFiles(llvm::SmallVectorImpl<llvm::SmallVector<llvm::object::OffloadFile, 3u>>&, llvm::opt::InputArgList const&, char**, int) ClangLinkerWrapper.cpp:0:0
# | #16 0x000055fab04c838f main (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x5ce38f)
# | #17 0x00007ff56b8a6083 __libc_start_main /build/glibc-LcI20x/glibc-2.31/csu/../csu/libc-start.c:342:3
# | #18 0x000055fab0586b5e _start (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x68cb5e)
# |  #0 0x000055fab1a1e27f llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x1b2427f)
# |  #1 0x000055fab1a1b7c4 SignalHandler(int) Signals.cpp:0:0
# |  #2 0x00007ff56bdf8420 __restore_rt (/lib/x86_64-linux-gnu/libpthread.so.0+0x14420)
# |  #3 0x00007ff56b8c500b raise /build/glibc-LcI20x/glibc-2.31/signal/../sysdeps/unix/sysv/linux/raise.c:51:1
# |  #4 0x00007ff56b8a4859 abort /build/glibc-LcI20x/glibc-2.31/stdlib/abort.c:81:7
# |  #5 0x00007ff56b8a4729 get_sysdep_segment_value /build/glibc-LcI20x/glibc-2.31/intl/loadmsgcat.c:509:8
# |  #6 0x00007ff56b8a4729 _nl_load_domain /build/glibc-LcI20x/glibc-2.31/intl/loadmsgcat.c:970:34
# |  #7 0x00007ff56b8b5fd6 (/lib/x86_64-linux-gnu/libc.so.6+0x33fd6)
# |  #8 0x000055fab0a0a106 llvm::AMDGPUResourceUsageAnalysis::runOnModule(llvm::Module&) (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0xb10106)
# |  #9 0x000055fab11557a5 llvm::legacy::PassManagerImpl::run(llvm::Module&) (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x125b7a5)
# | #10 0x000055fab20d50bd codegen(llvm::lto::Config const&, llvm::TargetMachine*, std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>, unsigned int, llvm::Module&, llvm::ModuleSummaryIndex const&) LTOBackend.cpp:0:0
# | #11 0x000055fab20d6a6d llvm::lto::backend(llvm::lto::Config const&, std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>, unsigned int, llvm::Module&, llvm::ModuleSummaryIndex&) (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x21dca6d)
# | #12 0x000055fab20cb15a llvm::lto::LTO::runRegularLTO(std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>) (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x21d115a)
# | #13 0x000055fab20cb69c llvm::lto::LTO::run(std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>, std::function<llvm::Expected<std::function<llvm::Expected<std::unique_ptr<llvm::CachedFileStream, std::default_delete<llvm::CachedFileStream>>> (unsigned int, llvm::Twine const&)>> (unsigned int, llvm::StringRef, llvm::Twine const&)>) (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x21d169c)
# | #14 0x000055fab05a22c0 llvm::Error (anonymous namespace)::linkAndWrapDeviceFiles(llvm::SmallVectorImpl<llvm::SmallVector<llvm::object::OffloadFile, 3u>>&, llvm::opt::InputArgList const&, char**, int)::'lambda'(auto&)::operator()<llvm::SmallVector<llvm::object::OffloadFile, 3u>>(auto&) const ClangLinkerWrapper.cpp:0:0
# | #15 0x000055fab05a6fe6 (anonymous namespace)::linkAndWrapDeviceFiles(llvm::SmallVectorImpl<llvm::SmallVector<llvm::object::OffloadFile, 3u>>&, llvm::opt::InputArgList const&, char**, int) ClangLinkerWrapper.cpp:0:0
# | #16 0x000055fab04c838f main (/home/ompworker/bbot/openmp-offload-libc-amdgpu-runtime/llvm.build/bin/clang-linker-wrapper+0x5ce38f)
...

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder llvm-clang-x86_64-sie-win running on sie-win-worker while building clang,llvm at step 7 "test-build-unified-tree-check-all".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/46/builds/1587

Here is the relevant piece of the build log for the reference:

Step 7 (test-build-unified-tree-check-all) failure: test (failure)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stdout):
--
# RUN: at line 2
z:\b\llvm-clang-x86_64-sie-win\build\bin\clang.exe -cc1 -internal-isystem Z:\b\llvm-clang-x86_64-sie-win\build\lib\clang\19\include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\CodeGenHIP\default-attributes.hip | z:\b\llvm-clang-x86_64-sie-win\build\bin\filecheck.exe -check-prefix=OPTNONE Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\CodeGenHIP\default-attributes.hip
# executed command: 'z:\b\llvm-clang-x86_64-sie-win\build\bin\clang.exe' -cc1 -internal-isystem 'Z:\b\llvm-clang-x86_64-sie-win\build\lib\clang\19\include' -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - 'Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\CodeGenHIP\default-attributes.hip'
# executed command: 'z:\b\llvm-clang-x86_64-sie-win\build\bin\filecheck.exe' -check-prefix=OPTNONE 'Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\CodeGenHIP\default-attributes.hip'
# RUN: at line 5
z:\b\llvm-clang-x86_64-sie-win\build\bin\clang.exe -cc1 -internal-isystem Z:\b\llvm-clang-x86_64-sie-win\build\lib\clang\19\include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\CodeGenHIP\default-attributes.hip | z:\b\llvm-clang-x86_64-sie-win\build\bin\filecheck.exe -check-prefix=OPT Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\CodeGenHIP\default-attributes.hip
# executed command: 'z:\b\llvm-clang-x86_64-sie-win\build\bin\clang.exe' -cc1 -internal-isystem 'Z:\b\llvm-clang-x86_64-sie-win\build\lib\clang\19\include' -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - 'Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\CodeGenHIP\default-attributes.hip'
# executed command: 'z:\b\llvm-clang-x86_64-sie-win\build\bin\filecheck.exe' -check-prefix=OPT 'Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\CodeGenHIP\default-attributes.hip'
# .---command stderr------------
# | �[1mZ:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\CodeGenHIP\default-attributes.hip:55:9: �[0m�[0;1;31merror: �[0m�[1mOPT: expected string not found in input
# | �[0m// OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
# | �[0;1;32m        ^
# | �[0m�[1m<stdin>:19:10: �[0m�[0;1;30mnote: �[0m�[1mscanning from here
# | �[0m ret void
# | �[0;1;32m         ^
# | �[0m�[1m<stdin>:19:10: �[0m�[0;1;30mnote: �[0m�[1mwith "ATTR0" equal to "0"
# | �[0m ret void
# | �[0;1;32m         ^
# | �[0m
# | Input file: <stdin>
# | Check file: Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\CodeGenHIP\default-attributes.hip
# | 
# | -dump-input=help explains the following input dump.
# | 
# | Input was:
# | <<<<<<
# | �[1m�[0m�[0;1;30m            1: �[0m�[1m�[0;1;46m; ModuleID = 'Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\CodeGenHIP\default-attributes.hip' �[0m
# | �[0;1;30m            2: �[0m�[1m�[0;1;46msource_filename = "Z:\\b\\llvm-clang-x86_64-sie-win\\llvm-project\\clang\\test\\CodeGenHIP\\default-attributes.hip" �[0m
# | �[0;1;30m            3: �[0m�[1m�[0;1;46mtarget datalayout = "e-p:64:64-p1:64:64-p2:32:32-p3:32:32-p4:64:64-p5:32:32-p6:32:32-p7:160:256:256:32-p8:128:128-p9:192:256:256:32-i64:64-v16:16-v24:32-v32:32-v48:64-v96:128-v192:256-v256:256-v512:512-v1024:1024-v2048:2048-n32:64-S32-A5-G1-ni:7:8:9" �[0m
# | �[0;1;30m            4: �[0m�[1m�[0;1;46mtarget triple = "amdgcn-amd-amdhsa" �[0m
# | �[0;1;30m            5: �[0m�[1m�[0;1;46m �[0m
# | �[0;1;30m            6: �[0m�[1m�[0;1;46m�[0m@__hip_cuid_ = addrspace(1) global i8 0�[0;1;46m �[0m
# | �[0;1;32mcheck:16       ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | �[0m�[0;1;30m            7: �[0m�[1m�[0;1;46m�[0m@__oclc_ABI_version = weak_odr hidden local_unnamed_addr addrspace(4) constant i32 500�[0;1;46m �[0m
# | �[0;1;32mcheck:17       ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | �[0m�[0;1;30m            8: �[0m�[1m�[0;1;46m�[0m@llvm.compiler.used = appending addrspace(1) global [1 x ptr] [ptr addrspacecast (ptr addrspace(1) @__hip_cuid_ to ptr)], section "llvm.metadata"�[0;1;46m �[0m
# | �[0;1;32mcheck:18       ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | �[0m�[0;1;30m            9: �[0m�[1m�[0;1;46m �[0m
# | �[0;1;30m           10: �[0m�[1m�[0;1;46m; �[0mFunction Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none)�[0;1;46m �[0m
# | �[0;1;32mcheck:26         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | �[0m�[0;1;30m           11: �[0m�[1m�[0;1;46m�[0mdefine dso_local void @_Z4funcv() local_unnamed_addr #0 {�[0;1;46m �[0m
# | �[0;1;32mlabel:27'0     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | �[0m�[0;1;32mlabel:27'1     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | �[0m�[0;1;32msame:28'0                                     ^~~~~~~~~~~~~~~~~~~~~~~~~~
# | �[0m�[0;1;32msame:28'1                                                            ^    captured var "ATTR0"
...

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder clang-cmake-x86_64-avx512-linux running on avx512-intel64 while building clang,llvm at step 7 "ninja check 1".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/133/builds/1521

Here is the relevant piece of the build log for the reference:

Step 7 (ninja check 1) failure: stage 1 checked (failure)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/stage1/bin/clang -cc1 -internal-isystem /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/stage1/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/llvm/clang/test/CodeGenHIP/default-attributes.hip | /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/stage1/bin/FileCheck -check-prefix=OPTNONE /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/llvm/clang/test/CodeGenHIP/default-attributes.hip
+ /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/stage1/bin/clang -cc1 -internal-isystem /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/stage1/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/llvm/clang/test/CodeGenHIP/default-attributes.hip
+ /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/stage1/bin/FileCheck -check-prefix=OPTNONE /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/llvm/clang/test/CodeGenHIP/default-attributes.hip
RUN: at line 5: /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/stage1/bin/clang -cc1 -internal-isystem /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/stage1/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/llvm/clang/test/CodeGenHIP/default-attributes.hip | /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/stage1/bin/FileCheck -check-prefix=OPT /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/llvm/clang/test/CodeGenHIP/default-attributes.hip
+ /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/stage1/bin/clang -cc1 -internal-isystem /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/stage1/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/llvm/clang/test/CodeGenHIP/default-attributes.hip
+ /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/stage1/bin/FileCheck -check-prefix=OPT /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/llvm/clang/test/CodeGenHIP/default-attributes.hip
/localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/llvm/clang/test/CodeGenHIP/default-attributes.hip:55:9: error: OPT: expected string not found in input
// OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
        ^
<stdin>:19:10: note: scanning from here
 ret void
         ^
<stdin>:19:10: note: with "ATTR0" equal to "0"
 ret void
         ^

Input file: <stdin>
Check file: /localdisk2/buildbot/llvm-worker/clang-cmake-x86_64-avx512-linux/llvm/clang/test/CodeGenHIP/default-attributes.hip

-dump-input=help explains the following input dump.

Input was:
<<<<<<
            .
            .
            .
           14: } 
           15:  
           16: ; Function Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none) 
           17: define dso_local amdgpu_kernel void @_Z6kernelv() local_unnamed_addr #1 { 
           18: entry: 
           19:  ret void 
check:55'0              X error: no match found
check:55'1                with "ATTR0" equal to "0"
           20: } 
check:55'0     ~~
           21:  
check:55'0     ~
           22: attributes #0 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "no-trapping-math"="true" "stack-protector-buffer-size"="8" } 
check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
           23: attributes #1 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-flat-work-group-size"="1,1024" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="true" } 
check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
           24:  
check:55'0     ~
            .
...

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder llvm-clang-aarch64-darwin running on doug-worker-4 while building clang,llvm at step 6 "test-build-unified-tree-check-all".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/190/builds/1798

Here is the relevant piece of the build log for the reference:

Step 6 (test-build-unified-tree-check-all) failure: test (failure)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/clang -cc1 -internal-isystem /Users/buildbot/buildbot-root/aarch64-darwin/build/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/FileCheck -check-prefix=OPTNONE /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/clang -cc1 -internal-isystem /Users/buildbot/buildbot-root/aarch64-darwin/build/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/FileCheck -check-prefix=OPTNONE /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
RUN: at line 5: /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/clang -cc1 -internal-isystem /Users/buildbot/buildbot-root/aarch64-darwin/build/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/FileCheck -check-prefix=OPT /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/clang -cc1 -internal-isystem /Users/buildbot/buildbot-root/aarch64-darwin/build/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/FileCheck -check-prefix=OPT /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
�[1m/Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip:55:9: �[0m�[0;1;31merror: �[0m�[1mOPT: expected string not found in input
�[0m// OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
�[0;1;32m        ^
�[0m�[1m<stdin>:19:10: �[0m�[0;1;30mnote: �[0m�[1mscanning from here
�[0m ret void
�[0;1;32m         ^
�[0m�[1m<stdin>:19:10: �[0m�[0;1;30mnote: �[0m�[1mwith "ATTR0" equal to "0"
�[0m ret void
�[0;1;32m         ^
�[0m
Input file: <stdin>
Check file: /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip

-dump-input=help explains the following input dump.

Input was:
<<<<<<
�[1m�[0m�[0;1;30m            1: �[0m�[1m�[0;1;46m; ModuleID = '/Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip' �[0m
�[0;1;30m            2: �[0m�[1m�[0;1;46msource_filename = "/Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip" �[0m
�[0;1;30m            3: �[0m�[1m�[0;1;46mtarget datalayout = "e-p:64:64-p1:64:64-p2:32:32-p3:32:32-p4:64:64-p5:32:32-p6:32:32-p7:160:256:256:32-p8:128:128-p9:192:256:256:32-i64:64-v16:16-v24:32-v32:32-v48:64-v96:128-v192:256-v256:256-v512:512-v1024:1024-v2048:2048-n32:64-S32-A5-G1-ni:7:8:9" �[0m
�[0;1;30m            4: �[0m�[1m�[0;1;46mtarget triple = "amdgcn-amd-amdhsa" �[0m
�[0;1;30m            5: �[0m�[1m�[0;1;46m �[0m
�[0;1;30m            6: �[0m�[1m�[0;1;46m�[0m@__hip_cuid_ = addrspace(1) global i8 0�[0;1;46m �[0m
�[0;1;32mcheck:16       ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;30m            7: �[0m�[1m�[0;1;46m�[0m@__oclc_ABI_version = weak_odr hidden local_unnamed_addr addrspace(4) constant i32 500�[0;1;46m �[0m
�[0;1;32mcheck:17       ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;30m            8: �[0m�[1m�[0;1;46m�[0m@llvm.compiler.used = appending addrspace(1) global [1 x ptr] [ptr addrspacecast (ptr addrspace(1) @__hip_cuid_ to ptr)], section "llvm.metadata"�[0;1;46m �[0m
�[0;1;32mcheck:18       ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;30m            9: �[0m�[1m�[0;1;46m �[0m
�[0;1;30m           10: �[0m�[1m�[0;1;46m; �[0mFunction Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none)�[0;1;46m �[0m
�[0;1;32mcheck:26         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;30m           11: �[0m�[1m�[0;1;46m�[0mdefine dso_local void @_Z4funcv() local_unnamed_addr #0 {�[0;1;46m �[0m
�[0;1;32mlabel:27'0     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;32mlabel:27'1     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;32msame:28'0                                     ^~~~~~~~~~~~~~~~~~~~~~~~~~
�[0m�[0;1;32msame:28'1                                                            ^    captured var "ATTR0"
�[0m�[0;1;30m           12: �[0m�[1m�[0;1;46m�[0mentry:�[0;1;46m �[0m
�[0;1;32mnext:29        ^~~~~~
�[0m�[0;1;30m           13: �[0m�[1m�[0;1;46m �[0mret void�[0;1;46m �[0m
...

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder clang-armv8-quick running on linaro-clang-armv8-quick while building clang,llvm at step 5 "ninja check 1".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/154/builds/1345

Here is the relevant piece of the build log for the reference:

Step 5 (ninja check 1) failure: stage 1 checked (failure)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /home/tcwg-buildbot/worker/clang-armv8-quick/stage1/bin/clang -cc1 -internal-isystem /home/tcwg-buildbot/worker/clang-armv8-quick/stage1/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /home/tcwg-buildbot/worker/clang-armv8-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip | /home/tcwg-buildbot/worker/clang-armv8-quick/stage1/bin/FileCheck -check-prefix=OPTNONE /home/tcwg-buildbot/worker/clang-armv8-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip
+ /home/tcwg-buildbot/worker/clang-armv8-quick/stage1/bin/clang -cc1 -internal-isystem /home/tcwg-buildbot/worker/clang-armv8-quick/stage1/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /home/tcwg-buildbot/worker/clang-armv8-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip
+ /home/tcwg-buildbot/worker/clang-armv8-quick/stage1/bin/FileCheck -check-prefix=OPTNONE /home/tcwg-buildbot/worker/clang-armv8-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip
RUN: at line 5: /home/tcwg-buildbot/worker/clang-armv8-quick/stage1/bin/clang -cc1 -internal-isystem /home/tcwg-buildbot/worker/clang-armv8-quick/stage1/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /home/tcwg-buildbot/worker/clang-armv8-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip | /home/tcwg-buildbot/worker/clang-armv8-quick/stage1/bin/FileCheck -check-prefix=OPT /home/tcwg-buildbot/worker/clang-armv8-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip
+ /home/tcwg-buildbot/worker/clang-armv8-quick/stage1/bin/clang -cc1 -internal-isystem /home/tcwg-buildbot/worker/clang-armv8-quick/stage1/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /home/tcwg-buildbot/worker/clang-armv8-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip
+ /home/tcwg-buildbot/worker/clang-armv8-quick/stage1/bin/FileCheck -check-prefix=OPT /home/tcwg-buildbot/worker/clang-armv8-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip
/home/tcwg-buildbot/worker/clang-armv8-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip:55:9: error: OPT: expected string not found in input
// OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
        ^
<stdin>:19:10: note: scanning from here
 ret void
         ^
<stdin>:19:10: note: with "ATTR0" equal to "0"
 ret void
         ^

Input file: <stdin>
Check file: /home/tcwg-buildbot/worker/clang-armv8-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip

-dump-input=help explains the following input dump.

Input was:
<<<<<<
            .
            .
            .
           14: } 
           15:  
           16: ; Function Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none) 
           17: define dso_local amdgpu_kernel void @_Z6kernelv() local_unnamed_addr #1 { 
           18: entry: 
           19:  ret void 
check:55'0              X error: no match found
check:55'1                with "ATTR0" equal to "0"
           20: } 
check:55'0     ~~
           21:  
check:55'0     ~
           22: attributes #0 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "no-trapping-math"="true" "stack-protector-buffer-size"="8" } 
check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
           23: attributes #1 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-flat-work-group-size"="1,1024" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="true" } 
check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
           24:  
check:55'0     ~
            .
...

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder clang-aarch64-quick running on linaro-clang-aarch64-quick while building clang,llvm at step 5 "ninja check 1".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/65/builds/1354

Here is the relevant piece of the build log for the reference:

Step 5 (ninja check 1) failure: stage 1 checked (failure)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /home/tcwg-buildbot/worker/clang-aarch64-quick/stage1/bin/clang -cc1 -internal-isystem /home/tcwg-buildbot/worker/clang-aarch64-quick/stage1/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /home/tcwg-buildbot/worker/clang-aarch64-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip | /home/tcwg-buildbot/worker/clang-aarch64-quick/stage1/bin/FileCheck -check-prefix=OPTNONE /home/tcwg-buildbot/worker/clang-aarch64-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip
+ /home/tcwg-buildbot/worker/clang-aarch64-quick/stage1/bin/clang -cc1 -internal-isystem /home/tcwg-buildbot/worker/clang-aarch64-quick/stage1/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /home/tcwg-buildbot/worker/clang-aarch64-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip
+ /home/tcwg-buildbot/worker/clang-aarch64-quick/stage1/bin/FileCheck -check-prefix=OPTNONE /home/tcwg-buildbot/worker/clang-aarch64-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip
RUN: at line 5: /home/tcwg-buildbot/worker/clang-aarch64-quick/stage1/bin/clang -cc1 -internal-isystem /home/tcwg-buildbot/worker/clang-aarch64-quick/stage1/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /home/tcwg-buildbot/worker/clang-aarch64-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip | /home/tcwg-buildbot/worker/clang-aarch64-quick/stage1/bin/FileCheck -check-prefix=OPT /home/tcwg-buildbot/worker/clang-aarch64-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip
+ /home/tcwg-buildbot/worker/clang-aarch64-quick/stage1/bin/clang -cc1 -internal-isystem /home/tcwg-buildbot/worker/clang-aarch64-quick/stage1/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /home/tcwg-buildbot/worker/clang-aarch64-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip
+ /home/tcwg-buildbot/worker/clang-aarch64-quick/stage1/bin/FileCheck -check-prefix=OPT /home/tcwg-buildbot/worker/clang-aarch64-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip
/home/tcwg-buildbot/worker/clang-aarch64-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip:55:9: error: OPT: expected string not found in input
// OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
        ^
<stdin>:19:10: note: scanning from here
 ret void
         ^
<stdin>:19:10: note: with "ATTR0" equal to "0"
 ret void
         ^

Input file: <stdin>
Check file: /home/tcwg-buildbot/worker/clang-aarch64-quick/llvm/clang/test/CodeGenHIP/default-attributes.hip

-dump-input=help explains the following input dump.

Input was:
<<<<<<
            .
            .
            .
           14: } 
           15:  
           16: ; Function Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none) 
           17: define dso_local amdgpu_kernel void @_Z6kernelv() local_unnamed_addr #1 { 
           18: entry: 
           19:  ret void 
check:55'0              X error: no match found
check:55'1                with "ATTR0" equal to "0"
           20: } 
check:55'0     ~~
           21:  
check:55'0     ~
           22: attributes #0 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "no-trapping-math"="true" "stack-protector-buffer-size"="8" } 
check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
           23: attributes #1 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-flat-work-group-size"="1,1024" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="true" } 
check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
           24:  
check:55'0     ~
            .
...

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder fuchsia-x86_64-linux running on fuchsia-debian-64-us-central1-a-1 while building clang,llvm at step 4 "annotate".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/11/builds/1668

Here is the relevant piece of the build log for the reference:

Step 4 (annotate) failure: 'python ../llvm-zorg/zorg/buildbot/builders/annotated/fuchsia-linux.py ...' (failure)
...
[1310/1312] Linking CXX executable tools/clang/unittests/Tooling/ToolingTests
[1311/1312] Running the Clang regression tests
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:508: note: using clang: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/clang
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/subst.py:126: note: Did not find clang-repl in /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin:/var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:508: note: using ld.lld: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/ld.lld
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:508: note: using lld-link: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/lld-link
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:508: note: using ld64.lld: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/ld64.lld
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:508: note: using wasm-ld: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/wasm-ld
-- Testing: 20952 tests, 60 workers --
Testing:  0.. 10.. 20.. 30.. 40
FAIL: Clang :: CodeGenHIP/default-attributes.hip (9238 of 20952)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/clang -cc1 -internal-isystem /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/FileCheck -check-prefix=OPTNONE /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/clang -cc1 -internal-isystem /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/FileCheck -check-prefix=OPTNONE /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
RUN: at line 5: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/clang -cc1 -internal-isystem /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/FileCheck -check-prefix=OPT /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/clang -cc1 -internal-isystem /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/FileCheck -check-prefix=OPT /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
/var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip:55:9: error: OPT: expected string not found in input
// OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
        ^
<stdin>:19:10: note: scanning from here
 ret void
         ^
<stdin>:19:10: note: with "ATTR0" equal to "0"
 ret void
         ^

Input file: <stdin>
Check file: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip

-dump-input=help explains the following input dump.

Input was:
<<<<<<
            .
            .
            .
           14: } 
           15:  
           16: ; Function Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none) 
           17: define dso_local amdgpu_kernel void @_Z6kernelv() local_unnamed_addr #1 { 
           18: entry: 
           19:  ret void 
check:55'0              X error: no match found
Step 7 (check) failure: check (failure)
...
[1310/1312] Linking CXX executable tools/clang/unittests/Tooling/ToolingTests
[1311/1312] Running the Clang regression tests
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:508: note: using clang: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/clang
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/subst.py:126: note: Did not find clang-repl in /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin:/var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:508: note: using ld.lld: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/ld.lld
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:508: note: using lld-link: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/lld-link
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:508: note: using ld64.lld: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/ld64.lld
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:508: note: using wasm-ld: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/wasm-ld
-- Testing: 20952 tests, 60 workers --
Testing:  0.. 10.. 20.. 30.. 40
FAIL: Clang :: CodeGenHIP/default-attributes.hip (9238 of 20952)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/clang -cc1 -internal-isystem /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/FileCheck -check-prefix=OPTNONE /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/clang -cc1 -internal-isystem /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/FileCheck -check-prefix=OPTNONE /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
RUN: at line 5: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/clang -cc1 -internal-isystem /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/FileCheck -check-prefix=OPT /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/clang -cc1 -internal-isystem /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-_rzgovtk/bin/FileCheck -check-prefix=OPT /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
/var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip:55:9: error: OPT: expected string not found in input
// OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
        ^
<stdin>:19:10: note: scanning from here
 ret void
         ^
<stdin>:19:10: note: with "ATTR0" equal to "0"
 ret void
         ^

Input file: <stdin>
Check file: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/clang/test/CodeGenHIP/default-attributes.hip

-dump-input=help explains the following input dump.

Input was:
<<<<<<
            .
            .
            .
           14: } 
           15:  
           16: ; Function Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none) 
           17: define dso_local amdgpu_kernel void @_Z6kernelv() local_unnamed_addr #1 { 
           18: entry: 
           19:  ret void 
check:55'0              X error: no match found

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder llvm-clang-x86_64-darwin running on doug-worker-3 while building clang,llvm at step 6 "test-build-unified-tree-check-all".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/23/builds/966

Here is the relevant piece of the build log for the reference:

Step 6 (test-build-unified-tree-check-all) failure: test (failure)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /Volumes/RAMDisk/buildbot-root/x86_64-darwin/build/bin/clang -cc1 -internal-isystem /Volumes/RAMDisk/buildbot-root/x86_64-darwin/build/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /Volumes/RAMDisk/buildbot-root/x86_64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /Volumes/RAMDisk/buildbot-root/x86_64-darwin/build/bin/FileCheck -check-prefix=OPTNONE /Volumes/RAMDisk/buildbot-root/x86_64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /Volumes/RAMDisk/buildbot-root/x86_64-darwin/build/bin/clang -cc1 -internal-isystem /Volumes/RAMDisk/buildbot-root/x86_64-darwin/build/lib/clang/19/include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /Volumes/RAMDisk/buildbot-root/x86_64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /Volumes/RAMDisk/buildbot-root/x86_64-darwin/build/bin/FileCheck -check-prefix=OPTNONE /Volumes/RAMDisk/buildbot-root/x86_64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
RUN: at line 5: /Volumes/RAMDisk/buildbot-root/x86_64-darwin/build/bin/clang -cc1 -internal-isystem /Volumes/RAMDisk/buildbot-root/x86_64-darwin/build/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - /Volumes/RAMDisk/buildbot-root/x86_64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip | /Volumes/RAMDisk/buildbot-root/x86_64-darwin/build/bin/FileCheck -check-prefix=OPT /Volumes/RAMDisk/buildbot-root/x86_64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /Volumes/RAMDisk/buildbot-root/x86_64-darwin/build/bin/clang -cc1 -internal-isystem /Volumes/RAMDisk/buildbot-root/x86_64-darwin/build/lib/clang/19/include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - /Volumes/RAMDisk/buildbot-root/x86_64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
+ /Volumes/RAMDisk/buildbot-root/x86_64-darwin/build/bin/FileCheck -check-prefix=OPT /Volumes/RAMDisk/buildbot-root/x86_64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip
/Volumes/RAMDisk/buildbot-root/x86_64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip:55:9: error: OPT: expected string not found in input
// OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
        ^
<stdin>:19:10: note: scanning from here
 ret void
         ^
<stdin>:19:10: note: with "ATTR0" equal to "0"
 ret void
         ^

Input file: <stdin>
Check file: /Volumes/RAMDisk/buildbot-root/x86_64-darwin/llvm-project/clang/test/CodeGenHIP/default-attributes.hip

-dump-input=help explains the following input dump.

Input was:
<<<<<<
            .
            .
            .
           14: } 
           15:  
           16: ; Function Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none) 
           17: define dso_local amdgpu_kernel void @_Z6kernelv() local_unnamed_addr #1 { 
           18: entry: 
           19:  ret void 
check:55'0              X error: no match found
check:55'1                with "ATTR0" equal to "0"
           20: } 
check:55'0     ~~
           21:  
check:55'0     ~
           22: attributes #0 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "no-trapping-math"="true" "stack-protector-buffer-size"="8" } 
check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
           23: attributes #1 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-flat-work-group-size"="1,1024" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="true" } 
check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
           24:  
check:55'0     ~
            .
...

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder clang-arm64-windows-msvc running on linaro-armv8-windows-msvc-04 while building clang,llvm at step 5 "ninja check 1".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/161/builds/575

Here is the relevant piece of the build log for the reference:

Step 5 (ninja check 1) failure: stage 1 checked (failure)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stdout):
--
# RUN: at line 2
c:\users\tcwg\llvm-worker\clang-arm64-windows-msvc\stage1\bin\clang.exe -cc1 -internal-isystem C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\stage1\lib\clang\19\include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\llvm\clang\test\CodeGenHIP\default-attributes.hip | c:\users\tcwg\llvm-worker\clang-arm64-windows-msvc\stage1\bin\filecheck.exe -check-prefix=OPTNONE C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\llvm\clang\test\CodeGenHIP\default-attributes.hip
# executed command: 'c:\users\tcwg\llvm-worker\clang-arm64-windows-msvc\stage1\bin\clang.exe' -cc1 -internal-isystem 'C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\stage1\lib\clang\19\include' -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - 'C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\llvm\clang\test\CodeGenHIP\default-attributes.hip'
# executed command: 'c:\users\tcwg\llvm-worker\clang-arm64-windows-msvc\stage1\bin\filecheck.exe' -check-prefix=OPTNONE 'C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\llvm\clang\test\CodeGenHIP\default-attributes.hip'
# RUN: at line 5
c:\users\tcwg\llvm-worker\clang-arm64-windows-msvc\stage1\bin\clang.exe -cc1 -internal-isystem C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\stage1\lib\clang\19\include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\llvm\clang\test\CodeGenHIP\default-attributes.hip | c:\users\tcwg\llvm-worker\clang-arm64-windows-msvc\stage1\bin\filecheck.exe -check-prefix=OPT C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\llvm\clang\test\CodeGenHIP\default-attributes.hip
# executed command: 'c:\users\tcwg\llvm-worker\clang-arm64-windows-msvc\stage1\bin\clang.exe' -cc1 -internal-isystem 'C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\stage1\lib\clang\19\include' -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - 'C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\llvm\clang\test\CodeGenHIP\default-attributes.hip'
# executed command: 'c:\users\tcwg\llvm-worker\clang-arm64-windows-msvc\stage1\bin\filecheck.exe' -check-prefix=OPT 'C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\llvm\clang\test\CodeGenHIP\default-attributes.hip'
# .---command stderr------------
# | C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\llvm\clang\test\CodeGenHIP\default-attributes.hip:55:9: error: OPT: expected string not found in input
# | // OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
# |         ^
# | <stdin>:19:10: note: scanning from here
# |  ret void
# |          ^
# | <stdin>:19:10: note: with "ATTR0" equal to "0"
# |  ret void
# |          ^
# | 
# | Input file: <stdin>
# | Check file: C:\Users\tcwg\llvm-worker\clang-arm64-windows-msvc\llvm\clang\test\CodeGenHIP\default-attributes.hip
# | 
# | -dump-input=help explains the following input dump.
# | 
# | Input was:
# | <<<<<<
# |             .
# |             .
# |             .
# |            14: } 
# |            15:  
# |            16: ; Function Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none) 
# |            17: define dso_local amdgpu_kernel void @_Z6kernelv() local_unnamed_addr #1 { 
# |            18: entry: 
# |            19:  ret void 
# | check:55'0              X error: no match found
# | check:55'1                with "ATTR0" equal to "0"
# |            20: } 
# | check:55'0     ~~
# |            21:  
# | check:55'0     ~
# |            22: attributes #0 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "no-trapping-math"="true" "stack-protector-buffer-size"="8" } 
# | check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            23: attributes #1 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-flat-work-group-size"="1,1024" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="true" } 
# | check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
...

@llvm-ci
Copy link
Collaborator

llvm-ci commented Jul 14, 2024

LLVM Buildbot has detected a new failure on builder llvm-clang-win-x-aarch64 running on as-builder-2 while building clang,llvm at step 9 "test-check-clang".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/193/builds/940

Here is the relevant piece of the build log for the reference:

Step 9 (test-check-clang) failure: Test just built components for check-clang completed (failure)
******************** TEST 'Clang :: CodeGenHIP/default-attributes.hip' FAILED ********************
Exit Code: 1

Command Output (stdout):
--
# RUN: at line 2
c:\buildbot\as-builder-2\x-aarch64\build\bin\clang.exe -cc1 -internal-isystem C:\buildbot\as-builder-2\x-aarch64\build\lib\clang\19\include -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - C:\buildbot\as-builder-2\x-aarch64\llvm-project\clang\test\CodeGenHIP\default-attributes.hip | c:\buildbot\as-builder-2\x-aarch64\build\bin\filecheck.exe -check-prefix=OPTNONE C:\buildbot\as-builder-2\x-aarch64\llvm-project\clang\test\CodeGenHIP\default-attributes.hip
# executed command: 'c:\buildbot\as-builder-2\x-aarch64\build\bin\clang.exe' -cc1 -internal-isystem 'C:\buildbot\as-builder-2\x-aarch64\build\lib\clang\19\include' -nostdsysteminc -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - 'C:\buildbot\as-builder-2\x-aarch64\llvm-project\clang\test\CodeGenHIP\default-attributes.hip'
# executed command: 'c:\buildbot\as-builder-2\x-aarch64\build\bin\filecheck.exe' -check-prefix=OPTNONE 'C:\buildbot\as-builder-2\x-aarch64\llvm-project\clang\test\CodeGenHIP\default-attributes.hip'
# RUN: at line 5
c:\buildbot\as-builder-2\x-aarch64\build\bin\clang.exe -cc1 -internal-isystem C:\buildbot\as-builder-2\x-aarch64\build\lib\clang\19\include -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device     -emit-llvm -o - C:\buildbot\as-builder-2\x-aarch64\llvm-project\clang\test\CodeGenHIP\default-attributes.hip | c:\buildbot\as-builder-2\x-aarch64\build\bin\filecheck.exe -check-prefix=OPT C:\buildbot\as-builder-2\x-aarch64\llvm-project\clang\test\CodeGenHIP\default-attributes.hip
# executed command: 'c:\buildbot\as-builder-2\x-aarch64\build\bin\clang.exe' -cc1 -internal-isystem 'C:\buildbot\as-builder-2\x-aarch64\build\lib\clang\19\include' -nostdsysteminc -O3 -triple amdgcn-amd-amdhsa -x hip -fno-ident -fcuda-is-device -emit-llvm -o - 'C:\buildbot\as-builder-2\x-aarch64\llvm-project\clang\test\CodeGenHIP\default-attributes.hip'
# executed command: 'c:\buildbot\as-builder-2\x-aarch64\build\bin\filecheck.exe' -check-prefix=OPT 'C:\buildbot\as-builder-2\x-aarch64\llvm-project\clang\test\CodeGenHIP\default-attributes.hip'
# .---command stderr------------
# | C:\buildbot\as-builder-2\x-aarch64\llvm-project\clang\test\CodeGenHIP\default-attributes.hip:55:9: error: OPT: expected string not found in input
# | // OPT: attributes #[[ATTR0]] = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-no-agpr" "amdgpu-no-completion-action" "amdgpu-no-default-queue" "amdgpu-no-dispatch-id" "amdgpu-no-dispatch-ptr" "amdgpu-no-heap-ptr" "amdgpu-no-hostcall-ptr" "amdgpu-no-implicitarg-ptr" "amdgpu-no-lds-kernel-id" "amdgpu-no-multigrid-sync-arg" "amdgpu-no-queue-ptr" "amdgpu-no-workgroup-id-x" "amdgpu-no-workgroup-id-y" "amdgpu-no-workgroup-id-z" "amdgpu-no-workitem-id-x" "amdgpu-no-workitem-id-y" "amdgpu-no-workitem-id-z" "amdgpu-waves-per-eu"="4,10" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="false" }
# |         ^
# | <stdin>:19:10: note: scanning from here
# |  ret void
# |          ^
# | <stdin>:19:10: note: with "ATTR0" equal to "0"
# |  ret void
# |          ^
# | 
# | Input file: <stdin>
# | Check file: C:\buildbot\as-builder-2\x-aarch64\llvm-project\clang\test\CodeGenHIP\default-attributes.hip
# | 
# | -dump-input=help explains the following input dump.
# | 
# | Input was:
# | <<<<<<
# |             .
# |             .
# |             .
# |            14: } 
# |            15:  
# |            16: ; Function Attrs: mustprogress nofree norecurse nosync nounwind willreturn memory(none) 
# |            17: define dso_local amdgpu_kernel void @_Z6kernelv() local_unnamed_addr #1 { 
# |            18: entry: 
# |            19:  ret void 
# | check:55'0              X error: no match found
# | check:55'1                with "ATTR0" equal to "0"
# |            20: } 
# | check:55'0     ~~
# |            21:  
# | check:55'0     ~
# |            22: attributes #0 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "no-trapping-math"="true" "stack-protector-buffer-size"="8" } 
# | check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            23: attributes #1 = { mustprogress nofree norecurse nosync nounwind willreturn memory(none) "amdgpu-flat-work-group-size"="1,1024" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "uniform-work-group-size"="true" } 
# | check:55'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
...

aaryanshukla pushed a commit to aaryanshukla/llvm-project that referenced this pull request Jul 14, 2024
Removing it from the codegen pipeline induces a lot of test churn
because llc is no longer optimizing out implicit arguments to kernels.

Mostly mechanical, but there are some creative test updates. I preferred
to take the changes as-is in tests where the ABI isn't relevant. In
cases where it's more relevant, or the optimize out logic was too
ingrained in the test, I pre-run the optimization. Some cases manually
add attributes to disable inputs.
@dyung
Copy link
Collaborator

dyung commented Jul 14, 2024

@arsenm are you aware that there is a test failure from this change that is still failing now about 16 hours later? Can you please take a look and revert if you need time to investigate so that we can get the bots back to green?

Some failing bots:

  1. https://lab.llvm.org/buildbot/#/builders/144/builds/2221
  2. https://lab.llvm.org/buildbot/#/builders/46/builds/1587
  3. https://lab.llvm.org/buildbot/#/builders/190/builds/1798

@petrhosek
Copy link
Member

I think we should go ahead and revert this change given that it's been almost an entire day and the builders are still broken.

dyung added a commit to dyung/llvm-project that referenced this pull request Jul 15, 2024
…" and follow up commit "clang/AMDGPU: Defeat attribute optimization in attribute test"

This reverts commits 677cc15 and 78bc1b6.

The test CodeGenHIP/default-attributes.hip is failing on multiple bots.
arsenm added a commit that referenced this pull request Jul 15, 2024
… and follow up commit "clang/AMDGPU: Defeat attribute optimization in attribute test" (#98851)

This reverts commit adaff46.

Drop the -O3 checks from default-attributes.hip. I don't know why they
are different on some bots but reverting this is far too disruptive.
@jplehr
Copy link
Contributor

jplehr commented Jul 15, 2024

Hey @arsenm this broke all AMDGPU OpenMP Offload buildbots (e.g., https://lab.llvm.org/buildbot/#/builders/30).

Any chance you can fix these issues?

@arsenm
Copy link
Contributor Author

arsenm commented Jul 15, 2024

Hey @arsenm this broke all AMDGPU OpenMP Offload buildbots (e.g., https://lab.llvm.org/buildbot/#/builders/30).

Any chance you can fix these issues?

Can you attach before/after IR and just XFAIL them for now?

antiagainst pushed a commit to triton-lang/triton that referenced this pull request Sep 5, 2024
In a recent change to the [LLVM AMD
backend](llvm/llvm-project#83131), we moved the
`AMDGPUAttributor` pass into the optimization pipeline (as opposed to
the codegen pipeline).

Since this is a pass specific for `AMD` targets, we want to pass the
`TargetMachine` when building the pipeline, i.e., during the call to
`optimize_module`.

Failure to do so will result in an increase of number of registers used.
Also, we spoke with our LLVM backend team, and they advised to always
pass the `TargetMachine` when building the LLVM optimization pipeline.

This PR is addressing this issue, in the following way:
- I added optional parameters to the `optimize_module` funciton (similar
to those passed to `translate_to_asm`)
- if those params are passed in, then we will create the `TargetMachine`
and pass it to the `PassBuilder`
- Otherwise the `TargetMachine` will still be `nullptr` (as it was
before)

Please note that, as it stands now, this change will only effect the AMD
backend.
vlad-penkin pushed a commit to intel/intel-xpu-backend-for-triton that referenced this pull request Sep 6, 2024
In a recent change to the [LLVM AMD
backend](llvm/llvm-project#83131), we moved the
`AMDGPUAttributor` pass into the optimization pipeline (as opposed to
the codegen pipeline).

Since this is a pass specific for `AMD` targets, we want to pass the
`TargetMachine` when building the pipeline, i.e., during the call to
`optimize_module`.

Failure to do so will result in an increase of number of registers used.
Also, we spoke with our LLVM backend team, and they advised to always
pass the `TargetMachine` when building the LLVM optimization pipeline.

This PR is addressing this issue, in the following way:
- I added optional parameters to the `optimize_module` funciton (similar
to those passed to `translate_to_asm`)
- if those params are passed in, then we will create the `TargetMachine`
and pass it to the `PassBuilder`
- Otherwise the `TargetMachine` will still be `nullptr` (as it was
before)

Please note that, as it stands now, this change will only effect the AMD
backend.
searlmc1 pushed a commit to ROCm/llvm-project that referenced this pull request Nov 22, 2024
…lvm#83131)" and follow up commit "clang/AMDGPU: Defeat attribute optimization in attribute test" (llvm#98851)"

This reverts commit b1bcb7c.

Change-Id: Ia262230003989ed152f82ea475364b42d2592090
bertmaher pushed a commit to bertmaher/triton that referenced this pull request Dec 10, 2024
In a recent change to the [LLVM AMD
backend](llvm/llvm-project#83131), we moved the
`AMDGPUAttributor` pass into the optimization pipeline (as opposed to
the codegen pipeline).

Since this is a pass specific for `AMD` targets, we want to pass the
`TargetMachine` when building the pipeline, i.e., during the call to
`optimize_module`.

Failure to do so will result in an increase of number of registers used.
Also, we spoke with our LLVM backend team, and they advised to always
pass the `TargetMachine` when building the LLVM optimization pipeline.

This PR is addressing this issue, in the following way:
- I added optional parameters to the `optimize_module` funciton (similar
to those passed to `translate_to_asm`)
- if those params are passed in, then we will create the `TargetMachine`
and pass it to the `PassBuilder`
- Otherwise the `TargetMachine` will still be `nullptr` (as it was
before)

Please note that, as it stands now, this change will only effect the AMD
backend.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants