Open
Description
Performance comes back when reverting 7ef77eb
Compilation command: flang -Ofast -mcpu=neoverse-v1 linpk.f90 -frtlib-add-rpath
Before the change:
2124.44 msec task-clock # 0.997 CPUs utilized
5510839431 cycles # 2.594 GHz
6419270661 instructions # 1.16 insn per cycle
After the change:
3101.06 msec task-clock # 0.997 CPUs utilized
8045294105 cycles # 2.594 GHz
5452469574 instructions # 0.68 insn per cycle
Attaching LLVM IR dumps before and after the change, and the assembly.